1/185 



1 GGATC C C AGGGAAC GT GAC C ATG GTC GTA GGG ATG ACT TGA C AGT T T CAAC GGGGT GC GAC CAC C GT T GC GC 72 
1 MVVGMT* 7 

73 T CAGAAGGC AT AC GT TGGT GGAAC AC GTC GGAAAGC T GGGAGGTGAAT C T G ATG GCT GGC GAC CAA GAG CTG 14 4 
1 > MAGDQEL 7 

145 GAA CTG CGG TTC GAC GTT CCT CTT TAC ACG CTT GCC GAG GCA TCG CG'j TAC CTG GTG GTT 204 
8E L R F DV P L Y T L AEAS R.j Y L V V 27 

205 CCC CGC GCC ACC CTG GCT ACG TGG GCT GAC GGC TAC GAG CGT CGG CCG GCC AAC GCA CCG 2 64 
28 P RAT. LATWAD GY E R R PANAP 47 

2 65 GCG GTC CAG GGG CAA CCG ATC GCC TTT GAC GCC TAT TCG GTC GCG CAG CTT TTT GGC GAC 324 
48AVQGQP IAF DAYSVAQLFGD 67 

325 GTC ACT GGT GCC CGC GTT GCG GGC GTC CAG CCG CAG CGA CAC CAC ATA CGG CCG GTC CGG 38 4 
68 VT GARVAGVQ PQRHH I RPVR 87 

385 TTG CGG GGG CCG TTG GGT GGG GTT GGG TGC CTC CGT CAC CCC AGG CAG TTC GCT GGC TAT 44 4 
88 L R G P LGGVGC LRHP RQFAGY 107 

4 45 TTG TCG CAG TAG CGCGACGGCATTGTCG ATG TCT TGG TAG CTAGCATCCGGTCGGGGGGCCGCTACCAGCG 515 
108 LSQ* MSW* 4 

516 CCAGCGCCGGGGCTCCCCGGTCCGGGTAGTGCGCGTCGAGTTGGTCGTGGACCAGCA ATG ACT GCG ACC CGG 58 7 
1 M T A T R 5 

588 CGA CTT CGA AAC CGC CAC CGG TTA GAT TCC CCG ACT GCG TCA TCG CCA GGT AAA CCG CCG 647 
6RLRNRHRLDS PTASSPGKPP 25 

648 GCA CTA ACG CCA GCA ACC AAC CCG TGA AGAC CAAC C AACGGC AC CT GCGCAGGT TGC GGC T CAAC CGC ATC 718 

26 ALTPATNP* 34 

719 ATG AAC TGC TGG ATT TCG GAC TCC CCG TAC TCT CGC GCA GTG CGT GCC CGC GAG CCT ACC 778 
1MNCWI S DS P YSRAVRARE PT 20 

779 GAA GAT CGC GTG CAT GCG TTC GGC GTG .GAC CGC ACA GCA CCT GGA GTT GGC GGC GCC GAG 838 
21 E DRVHAFGVD RTAP GVGGAE 40 

839 GGC CGA GAT GGC AGG ATG ACG GAT CGT CGG GGG CGG GAA CTC CCA GGC CGC CGG ACC GTC 898 
41GRDGRMTDRRGRELPGRRTV 60 

899 GCA AAC CCG TCG CAA ACC CGT CGC AAA CCG TAA GGAGT CATC C ATG AAG ACA GGC ACC GCG 959 
61ANPSQTRR KP* MKTGTA 6 

960 ACG ACG CGG CGC AGG CTG TTG GCA GTA CTG ATC GCC CTC GCG TTG CCG GGG GCC GCC GTT 1019 
7TTRRRLLAVL IALAL P GAAV 26 

1020 GCG CTG CTG GCC GAA CCA TCA GCG ACC GGC GCG TCG GAC CCG TGC GCG GCC AGC GAA GTG 107 9 

27 AL LAE P S AT GAS D P CAAS EV 46 

1080 GCG AGG ACG GTC GGT TCG GTC GCC AAG TCG ATG GGC GAC TAC CTG GAT TCA CAC CCA GAG 1139 
47ARTVG SVAKS MGDYLDSHP E 66 

1140 ACC AAC CAG GTG ATG ACC GCG GTC TTG CAG CAG CAG GTA GGG CCG GGG TCG GTC GCA TCG 1199 
67TNQVMTAVLQQQVGPGSVAS 86 

1200 CTG AAG GCC CAT TTC GAG GCG AAT CCC AAG GTC GCA TCG GAT CC 1243 
87 L KAH F EAN P KVAS D 100 
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FIGURE 1 
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Insert of the clone containing DP428 and contained in seql 
1/1 31/11 

GAT CGC CTT TGA CGC CTA TTC GGT CGC GCA GCT TTT TGG CGA CGT CAC TGG TGC CCG CGT 
asp arg leu OPA arg leu phe gly arg ala ala phe trp arg arg his trp cys pro arg 
61/21 91/31 

TGC GGG CGT CCA GCC GCA GCG ACA CCA CAT ACG GCC GGT CCG GTT GCG GGG GCC GTT GGG 
cys gly arg pro ala ala ala thr pro his thr ala gly pro val ala gly ala val gly 
121/41 151/51 

TGG GGT TGG GTG CCT CCG TCA CCC CAG GCA GTT CGC TGG CTA TTT GTC GCA GTA GCG CGA 
trp gly trp val pro pro ser pro gin ala val arg trp leu phe val ala val ala arg 
181/61 211/71 

CGG CAT TGT CGA TGT CTT GGT AGC TAG CAT CCG GTC GGG GGG CCG CTA CCA GCG CCA GCG 
arg his cys arg cys leu gly ser AMB his pro val gly gly pro leu pro ala pro ala 
241/81 271/91 

CCG GGG CTC CCC GGT CCG GGT AGT GCG CGT CGA GTT GGT CGT GGA CCA GCA ATG ACT GCG 
pro gly leu pro gly pro gly ser ala arg arg val gly arg gly pro ala met thr ala 
301/101 331/111 

ACC CGG CGA CTT CGA AAC CGC CAC CGG TTA GAT TCC CCG ACT GCG TCA TCG CCA GGT AAA 
thr arg arg leu arg asn arg his arg leu asp ser pro thr ala ser ser pro gly lys 
361/121 391/131 

CCG CCG GCA CTA ACG CCA GCA ACC AAC CCG TGA AGA CCA ACC AAC GGC ACC TGC GCA GGT 
pro pro ala leu thr pro ala thr asn pro OPA arg pro thr asn gly thr cys ala gly 
421/141 451/151 

TGC GGC TCA ACC GCA TCA TGA ACT GCT GGA TTT CGG ACT CCC CGT ACT CTC GCG CAG TGC . 
cys gly ser thr ala ser OPA thr ala gly phe arg thr pro arg thr leu ala gin cys 
481/161 511/171 

GTG CCC GCG AGC CTA CCG AAG ATC GCG TGC ATG CGT TCG GCG TGG ACC GCA CAG CAC CTG 
val pro ala ser leu pro lys ile ala cys met arg ser ala trp thr ala gin his leu 
541/181 571/191 

GAG TTG GCG GCG CCG AGG GCC GAG ATG GCA GGA TGA CGG ATC GTC GGG GGC GGG AAC TCC 
glu leu ala ala pro arg ala glu met ala gly OPA arg ile val gly gly gly asn ser 
601/201 631/211 

CAG GCC GCC GGA CCG TCG CAA ACC CGT CGC AAA CCC GTC GCA AAC CGT AAG GAG TCA TCC 
gin ala ala gly pro ser gin thr arg arg lys pro val ala asn arg lys glu ser ser 
661/221 691/231 

ATG AAG ACA GGC ACC GCG ACG ACG CGG CGC AGG CTG TTG GCA GTA CTG ATC GCC CTC GCG 
met lys thr gly thr ala thr thr arg arg arg leu leu ala val leu ile ala leu ala 
721/241 751/251 

TTG CCG GGG GCC GCC GTT GCG CTG CTG GCC GAA CCA TCA GCG ACC GGC GCG TCG GAC CCG 
leu pro gly ala ala val ala leu leu ala glu pro ser ala thr gly ala ser asp pro 
781/261 811/271 

TGC GCG GCC AGC GAA GTG GCG AGG ACG GTC GGT TCG GTC GCC AAG TCG ATG GGC GAC TAC 
cys ala ala ser glu val ala arg thr val gly ser val ala lys ser met gly asp tyr 
841/281 871/291 

CTG GAT TCA CAC CCA GAG ACC AAC CAG GTG ATG ACC GCG GTC TTG CAG CAG CAG GTA GGG 
leu asp ser his pro glu thr asn gin val met thr ala val leu gin gin gin val gly 
901/301 931/311 

CCG GGG TCG GTC GCA TCG CTG AAG GCC CAT TTC GAG GCG AAT CCC AAG GTC GCA TCG GAT C 
pro gly ser val ala ser leu lys ala his phe glu ala asn pro lys val ala ser asp 
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Insert of the, clone 


containing DP428, other reading 


frame 












2/1 


















32/11 


















ATC GCC 


TTT 


GAC 


GCC 


TAT 


TCG 


GTC 


GCG 


CAG 


CTT TTT 


GGC 


GAC 


GTC 


ACT 


GGT 


GCC 


CGC 


GTT 


ile ala 


phe 


asp 


ala 


tyr 


ser 


val 


ala 


gin 


leu phe 


gly 


asp 


val 


thr 


gly 


ala 


arg 


val 


62/21 
















92/31 


















GCG GGC 


GTC 


CAG 


CCG 


CAG 


CGA 


CAC 


CAC 


ATA 


CGG CCG 


GTC 


CGG 


TTG 


CGG 


GGG 


CCG 


TTG 


GGT 


ala gly 


val 


gin 


pro 


gin 


arg 


his 


his 


ile 


arg pro 


val 


arg 


leu 


arg 


gly pro 


leu 


gly 


122/41 


















152/51 


















GGG GTT 


GGG 


TGC 


CTC 


CGT 


CAC 


CCC 


AGG 


CAG 


TTC GCT 


GGC 


TAT 


TTG 


TCG 


CAG 


TAG 


CGC 


GAC 


gly val 


gly 


cys 


leu 


arg 


his 


pro 


arg 


gin 


phe ala 


gly 


tyr 


leu 


ser 


gin 


AMB 


arg 


asp 


182/61 


















212/71 


















GGC ATT 


GTC 


GAT 


GTC 


TTG 


GTA 


GCT 


AGC 


ATC 


CGG TCG 


GGG 


GGC 


CGC 


TAC 


CAG 


CGC 


CAG 


CGC 


gly ile 


val 


asp 


val 


leu 


val 


ala 


ser 


ile 


arg ser 


gly 


gly 


arg 


tyr 


gin 


arg 


gin 


arg 


242/81 
















272/91 


















CGG GGC 


TCC 


CCG 


GTC 


CGG 


GTA 


GTG 


CGC 


GTC 


GAG TTG 


GTC 


GTG 


GAC 


CAG 


CAA 


TGA 


CI 


CGA 


arg gly 


ser 


pro 


val 


arg 


val 


val 


arg 


val 


glu leu 


val 


val 


asp 


gin 


gin 


OPA 


leu 


a rg 


302/101 


















332/111 


















CCC GGC 


GAC 


TTC 


GAA 


ACC 


GCC 


ACC 


GGT 


TAG 


ATT CCC 


CGA 


CTG 


CGT 


CAT 


CGC 


CAG 


GTA 


AAC 


pro gly 


asp 


phe 


glu 


thr 


ala 


thr 


gly 


AMB 


ile pro 


arg 


leu 


arg 


nis 


a rg 


gin 


val 


asn 


362/121 


















392/ 131 


















CGC CGG 


CAC 


TAA 


CGC 


CAG 


CAA 


CCA 


ACC 


CGT 


GAA GAC 


CAA 


CCA 


ACG 




pom 




pat 


pqirn 
bl 1 


arg arg 


his 


OCH 


arg 


gin 


gin 


pro 


thr 


arg 


glu asp 


gin 


pro 


thr 


ala 


pro 


aia 


gin 


val 


422/141 


















452/151 


















GCG GCT 


CAA 


CCG 


CAT 


CAT 


GAA 


CTG 


CTG 


GAT 


TTC GGA 


CTC 


CCC 


GTA 


CTC 


TCG 


CGC 


AGT 


GCG 


ala ala 


gin 


pro 


his 


his 


glu 


leu 


leu 


asp 


phe gly 


leu 


pro 


val 


leu 


ser 


arg 


ser 


— 1-1 

ala 


482/161 














512/171 


















TGC CCG 


CGA 


GCC 


TAC 


CGA 


AGA 


TCG 


CGT 


GCA 


TGC GTT 


CGG 


CGT 


GGA 


CCG 


CAC 


AGC 


ACC 


TGG 


cys pro 


arg 


ala 


tyr 


arg 


arg 


ser 


arg 


ala 


cys val 


arg 


arg 


gly 


pro 


nis 


ser 


tnr 


t rp 


542/181 


















572/191 


















AGT TGG 


CGG 


CGC 


CGA 


GGG 


CCG 


AGA 


TGG 


CAG 


GAT GAC 


GGA 


TCG 


TCG 


GGG 


c 




ACT 


C 


ser trp 


arg 


arg 


arg 


gly pro 


arg 


trp 


gin 


asp asp 


gly 


ser 


ser 


giy 


ala 


Cfl V 


tnr 


pro 


602/201 


















632/211 


















AGG CCG 


CCG 


GAC 


CGT 


CGC 


AAA 


CCC 


GTC 


GCA 


AAC CCG 


TCG 


CAA 


ACC 


GTA 


AGG 


AGT 




LLA 


arg pro 


pro 


asp 


arg 


arg 


lys 


pro 


val 


ala 


asn pro 


ser 


gin 


tnr 


val 


arg 


ser 


his 


p r o 


662/221 
















692/231 


















TGA AG A 


CAG 


GCA 


CCG 


CGA 


CGA 


CGC 


GGC 


GCA 


GGC TGT 


TGG 


CAG 


TAC 




TCG 


CCC 






OPA arg 


gin 


ala 


pro 


arg 


arg 


arg 


gly 


a _L ct 


gxy cys 


trp 


gin 


t~ v r 


UFA 


ser 


pro 


ser 


arg 


722/241 


















752/251 


















TGC CGG 


GGG 


CCG 


CCG 


TTG 


CGC 


TGC 


TGG 


CCG 


AAC CAT 


CAG 


CGA 


CCG 


GCG 


CGT 


CGG 


ACC 


CGT 


cys arg 


gly 


pro 


pro 


leu 


arg 


cys 


trp 


pro 


asn his 


gin 


arg 


pro 


ala 


arg 


arg 


thr 


arg 


782/261 


















812/271 


















GCG CGG 


CCA 


GCG 


AAG 


TGG 


CGA 


GGA 


CGG 


TCG 


GTT CGG 


TCG 


CCA 


AGT 


CGA 


TGG 


GCG 


ACT 


ACC 


ala arg 


pro 


ala 


lys 


trp 


arg 


gly 


arg 


ser 


val arg 


ser 


pro 


ser 


arg 


trp 


ala 


thr 


thr 


842/281 


















872/291 


















TGG ATT 


CAC 


ACC 


CAG 


AGA 


CCA 


ACC 


AGG 


TGA 


TGA CCG 


CGG 


TCT 


TGC 


AGC 


AGC 


AGG 


TAG 


GGC 


trp ile 


his 


thr 


gin 


arg 


pro 


thr 


arg 


OPA 


OPA pro 


arg 


ser 


cys 


ser 


ser 


arg 


AMB 


gly 


902/301 


















932/311 


















CGG GGT 


CGG 


TCG 


CAT 


CGC 


TGA 


AGG 


CCC 


ATT 


TCG AGG 


CGA 


ATC 


CCA 


AGG 


TCG 


CAT 


CGG 


ATC 


arg gly 


arg 


ser 


his 


arg 


OPA 


arg 


pro 


ile 


ser arg 


arg 


ile 


pro 


arg 


ser 


his 


arg 


ile 
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SeqlC: Insert of the DP428 clone, other 
3/1 

TCG CCT TTG ACG CCT ATT CGG TCG CGC AGC 
ser pro leu thr pro ile arg ser arg ser 
63/21 

CGG GCG TCC AGC CGC AGC GAC ACC ACA TAC 
arg ala ser ser arg ser asp thr thr tyr 
123/41 

GGG TTG GGT GCC TCC GTC ACC CCA GGC AGT 
gly leu gly ala ser val thr pro gly ser 
183/61 

GCA TTG TCG ATG TCT TGG TAG CTA GCA TCC 
ala leu ser met ser trp AMB leu ala ser 
243/81 

GGG GCT CCC CGG TCC GGG TAG TGC GCG TCG 
gly ala pro arg ser gly AMB cys ala ser 
303/101 

CCG GCG ACT TCG AAA CCG CCA CCG GTT AGA 
pro ala thr ser lys pro pro pro val arg 
363/121 

GCC GGC ACT AAC GCC AGC AAC CAA CCC GTG 
ala gly thr asn ala ser asn gin pro val 
423/141 

CGG CTC AAC CGC ATC ATG AAC TGC TGG ATT 
arg leu asn arg ile met asn cys trp ile 
483/161 

GCC CGC GAG CCT ACC GAA GAT CGC GTG CAT 
ala arg glu pro thr glu asp arg val his 
543/181 

GTT GGC GGC GCC GAG GGC CGA GAT GGC AGG 
val gly gly ala glu gly arg asp gly arg 
603/201 

GGC CGC CGG ACC GTC GCA AAC CCG TCG CAA 
gly arg arg thr val ala asn pro ser gin 
663/221 

GAA GAC AGG CAC CGC GAC GAC GCG GCG CAG 
glu asp arg his arg asp asp ala ala gin 
723/241 

GCC GGG GGC CGC CGT TGC GCT GCT GGC CGA 
ala gly gly arg arg cys ala ala gly arg 
783/261 

CGC GGC CAG CGA AGT GGC GAG GAC GGT CGG 
arg gly gin arg ser gly glu asp gly arg 
843/281 

GGA TTC ACA CCC AGA GAC CAA CCA GGT GAT 
gly phe thr pro arg asp gin pro gly asp 
903/301 

GGG GTC GGT CGC ATC GCT GAA GGC CCA TTT 
gly val gly arg ile ala glu gly pro phe 



reading frame 
33/11 

TTT TTG GCG ACG TCA CTG GTG CCC GCG TTG 
phe leu ala thr ser leu val pro ala leu 
93/31 

GGC CGG TCC GGT TGC GGG GGC CGT TGG GTG 
gly arg ser gly cys gly gly arg trp val 
153/51 

TCG CTG GCT ATT TGT CGC AGT AGC GCG ACG 
ser leu ala ile cys arg ser ser ala thr 
213/71 

GGT CGG GGG GCC GCT ACC AGC GCC AGC GCC 
gly arg gly ala ala thr ser ala ser ala 
273/91 

AGT TGG TCG TGG ACC AGC AAT GAC TGC GAC 
ser trp ser trp thr ser asn asp cys asp 
333/111 

TTC CCC GAC TGC GTC ATC GCC AGG TAA ACC 
phe pro asp cys val ile ala arg OCH thr 
393/131 

AAG ACC AAC CAA CGG CAC CTG CGC AGG TTG 
lys thr asn gin arg his leu arg arg leu 
453/151 

TCG GAC TCC CCG TAC TCT CGC GCA GTG CGT 
ser asp ser pro tyr ser arg ala val arg 
513/171 

GCG TTC GGC GTG GAC CGC ACA GCA CCT GGA 
ala phe gly val asp arg thr ala pro gly 
573/191 

ATG ACG GAT CGT CGG GGG CGG GAA CTC CCA 
met thr asp arg arg gly arg glu leu pro 
633/211 

ACC CGT CGC AAA CCG TAA GGA GTC ATC CAT 
thr arg arg lys pro OCH gly val ile his 
693/231 

GCT GTT GGC AGT ACT GAT CGC CCT CGC GTT 
ala val gly ser thr asp arg pro arg val 
753/251 

ACC ATC AGC GAC CGG CGC GTC GGA CCC GTG 
thr ile ser asp arg arg val gly pro val 
813/271 

TTC GGT CGC CAA GTC GAT GGG CGA CTA CCT 
phe gly arg gin val asp gly arg leu pro 
873/291 

GAC CGC GGT CTT GCA GCA GCA GGT AGG GCC 
asp arg gly leu ala ala ala gly arg ala 
933/311 

CGA GGC GAA TCC CAA GGT CGC ATC GGA TC 
arg gly glu ser gin gly arg ile gly 



SEQ ID No.lC 
FIGURE 1C 



REPLACEMENT SHEET (RULE 26) 



# 
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Coding sequence DP428 identical to the Rv0203 predicted by Cole et al . 
(Nature 393:537-544) 

1/1 31/11 

ATG AAG ACA GGC ACC GCG ACG ACG CGG CGC AGG CTG TTG GCA GTA CTG ATC GCC CTC GCG 
Met lys thr gly thr ala thr thr arg arg arg leu leu ala val leu ile ala leu ala 
61/21 91/31 

TTG CCG GGG GCC GCC GTT GCG CTG CTG GCC GAA CCA TCA GCG ACC GGC GCG TCG GAC CCG 
leu pro gly ala ala val ala leu leu ala glu pro ser ala thr gly ala ser asp pro 
121/41 151/51 

TGC GCG GCC AGC GAA GTG GCG AGG ACG GTC GGT TCG GTC GCC AAG TCG ATG GGC GAC TAC 
cys ala ala ser glu val ala arg thr val gly ser val ala lys ser met gly asp tyr 
181/61 211/71 

CTG GAT TCA CAC CCA GAG ACC AAC CAG GTG ATG ACC GCG GTC TTG CAG CAG CAG GTA GGG 
leu asp ser his pro glu thr asn gin val met thr ala val leu gin gin gin val gly 
241/81 271/91 

CCG GGG TCG GTC GCA TCG CTG AAG GCC CAT TTC GAG GCG AAT CCC AAG GTC GCA TCG GAT 
pro gly ser val ala ser leu lys ala his phe glu ala asn pro lys val ala ser asp 
301/101 331/111 

CTG CAC GCG CTT TCG CAA CCG CTG ACC GAT CTT TCG ACT CGG TGC TCG CTG CCG ATC AGC 
leu his ala leu ser gin pro leu thr asp leu ser thr arg cys ser leu pro ile ser 
361/121 391/131 

GGC CTG CAG GCG ATC GGT TTG ATG CAG GCG GTG CAG GGC GCC CGC CGG TAG 
gly leu gin ala ile gly leu met gin ala val gin gly ala arg arg AMB 

SEQ ID No. ID 



FIGURE ID 



ORF containing the DP428 sequence and forming part of seqlA' 



1/1 


















31/11 


















TGA CGG 


ATC 


GTC 


GGG 


GGC 


GGG 


AAC 


TCC 


CAG 


GCC GCC 


GGA 


CCG 


TCG 


CAA 


ACC 


CGT 


CGC 


AAA 


OPA arg 


ile 


val 


gly 


gly 


gly 


asn 


ser 


gin 


ala ala 


gly 


pro 


ser 


gin 


thr 


arg 


arg 


lys 


61/21 
















91/31 


















CCC GTC 


GCA 


AAC 


CGT 


AAG 


GAG 


TCA 


TCC 


ATG 


AAG ACA 


GGC 


ACC 


GCG 


ACG 


ACG 


CGG 


CGC 


AGG 


pro val 


ala 


asn 


arg 


lys 


glu 


ser 


ser 


met 


lys thr 


gly 


thr 


ala 


thr 


thr 


arg 


arg 


arg 


121/41 


















151/51 


















CTG TTG 


GCA 


GTA 


CTG 


ATC 


GCC 


CTC 


GCG 


TTG 


CCG GGG 


GCC 


GCC 


GTT 


GCG 


CTG 


CTG 


GCC 


GAA 


leu leu 


ala 


val 


leu 


ile 


ala 


leu 


ala 


leu 


pro gly 


ala 


ala 


val 


ala 


leu 


leu 


ala 


glu 


181/61 


















211/71 


















CCA TCA 


GCG 


ACC 


GGC 


GCG 


TCG 


GAC 


CCG 


TGC 


GCG GCC 


AGC 


GAA 


GTG 


GCG 


AGG 


ACG 


GTC 


GGT 


pro ser 


ala 


thr 


gly 


ala 


ser 


asp 


pro 


cys 


ala ala 


ser 


glu 


val 


ala 


arg 


thr 


val 


gly 


241/81 


















271/91 


















TCG GTC 


GCC 


AAG 


TCG 


ATG 


GGC 


GAC 


TAC 


CTG 


GAT TCA 


CAC 


CCA 


GAG 


ACC 


AAC 


CAG 


GTG 


ATG 


ser val 


ala 


lys 


ser 


met 


gly 


asp 


tyr 


leu 


asp ser 


his 


pro 


glu 


thr 


asn 


gin 


val 


met 


301/101 
















331/111 


















ACC GCG 


GTC 


TTG 


CAG 


CAG 


CAG 


GTA 


GGG 


CCG 


GGG TCG 


GTC 


GCA 


TCG 


CTG 


AAG 


GCC 


CAT 


TTC 


thr ala 


val 


leu 


gin 


gin 


gin 


val 


gly 


pro 


gly ser 


val 


ala 


ser 


leu 


lys 


ala 


his 


phe 


361/121 


















391/131 
















CTT 


GAG GCG 


AAT 


CCC 


AAG 


GTC 


GCA 


TCG 


GAT 


CTG 


CAC GCG 


CTT 


TCG 


CAA 


CCG 


CTG 


ACC 


GAT 


glu ala 


asn 


pro 


lys 


val 


ala 


ser 


asp 


leu 


his ala 


leu 


ser 


gin 


pro 


leu 


thr 


asp 


leu 


421/141 
















451/151 
















GTG 


TCG ACT 


CGG 


TGC 


TCG 


CTG 


CCG 


ATC 


AGC 


GGC 


CTG CAG 


GCG 


ATC 


GGT 


TTG 


ATG 


CAG 


GCG 


ser thr 


arg 


cys 


ser 


leu 


pro 


ile 


ser 


gly 


leu gin 


ala 


ile 


gly 


leu 


met 


gin 


ala 


val 



481/161 

CAG GGC GCC CGC CGG TAG 
gin gly ala arg arg AMB 



SEQ ID No. IF 
FIGURE IF 



REPLACEMENT SHEET (RULE 26) 
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4 91 CCGGTCGGGGGGCCGCTACCAGCGCCAGCGCCGGGGCTCCCCGGTCCGGGTA GTG CGC GTC GAG TTG GTC GTG 563 
x VRVELVV7 

564 GAC CAG CAA TGA CTGCGACCCGGCGACTTCGAAACCGCCACCGGTTAGATTCCCCGACTGCGTCATCGCCAGGTAA 639 
8 D Q Q 

64 0 ACCGCCGGCACTAACGCCAGCAACCAACCC GTG AAG ACC AAC CAA CGG CAC CTG CGC AGG TTG CGG 7 05 
I VKTNQRHLRRLR12 



ins 
13 


L 


AAC 
N 


CGC 
R 


ATC 
I 


ATG 
M 


AAC 
N 


TGC 
C 


TGG 
W 


ATT 
I 


TCG 
S 


GAC 
D 


TCC 

S 


CCG 
P 


TAC 
Y 


TCT 
S 


CGC 
R 


GCA 
A 


GTG 
V 


CGT 
R 


GCC 
A 


765 

32 


766 
33 


CGC 
R 


GAG 
E 


CCT 
P 


ACC 
T 


GAA 
E 


GAT 
D 


CGC 
R 


GTG 
V 


CAT 
H 


GCG 
A 


TTC 
F 


GGC 
G 


GTG 
V 


GAC 
D 


CGC 
R 


ACA 
T 


GCA 
A 


CCT 
P 


GGA 
G 


GTT 
V 


825 
52 


826 
53 


GGC 
G 


GGC 
G 


GCC 
A 


GAG 
E 


GGC 
G 


CGA 
R 


GAT 
D 


GGC 
G 


AGG 
R 


ATG 
M 


ACG 
T 


GAT 
D 


CGT 
R 


CGG 
R 


GGG 
G 


CGG 
R 


GAA 
E 


CTC 
L 


CCA 
P 


GGC 
G 


885 
72 


886 
73 


CGC 
R 


CGG 
R 


ACC 
T 


GTC 
V 


GCA 
A 


AAC 
N 


CCG 
P 


TCG 

S 


CAA 
Q 


ACC 
T 


CGT 
R 


CGC 
R 


AAA 
K 


CCG 
P 


TAA GGAGTCATCC ATG AAG 
* xxxxxx M K 


946 
2 


947 
3 


ACA 
T 


GGC 
G 


ACC 
T 


GCG 
A 


ACG 
T 


ACG 
T 


CGG 
R 


CGC 
R 


AGG 
R 


CTG 
L 


TTG 
L 


GCA 
A 


GTA 
V 


CTG 
*L 


ATC 
I 


GCC 
A 


CTC 
L 


GCG 
A 


TTG 
L 


CCG 
P 


1006 
22 


1007 

23 


GGG 
G 


GCC 
A 


GCC 
A 


GTT 
V 


GCG 
A 


CTG 
L 


CTG 
L 


GCC 
A 


GAA 
E 


CCA 
P 


TCA 

S 


GCG 
A 


ACC 
T 


GGC 
G 


GCG 
A 


TCG 
S 


GAC 
D 


CCG 
P 


TGC 
C 


GCG 
A 


1066- 
42 


1067 
43 


GCC 
A 


AGC 
S 


GAA 
E 


GTG 
V 


GCG 
A 


AGG 
R 


ACG 
T 


GTC 
V 


GGT 
G 


TCG 'GTC 
S V 


GCC 
A 


AAG 
K 


TCG 
S 


ATG 
M 


GGC 
G 


GAC 
D 


TAC 

Y 


CTG 
L 


GAT 
D 


1126 
62 


1127 
63 


TCA 

S 


CAC 
H 


CCA 
P 


GAG 
E 


ACC 
T 


AAC 
N 


CAG 
Q 


GTG 
V 


ATG 
M 


ACC 
T 


GCG 
A 


GTC 
V 


TTG 
L 


CAG 
Q 


CAG 
Q 


CAG 
Q 


GTA 
V 


GGG 
G 


CCG 
P 


GGG 
G 


1186 
82 


1187 
83 


TCG 
S 


GTC 
V 


GCA 
A 


TCG 
S 


CTG 
L 


AAG 
K 


GCC 
A 


CAT 
H 


TTC 
F 


GAG 
E 


GCG 
A 


AAT 
N 


CCC 
P 


AAG 
K 


GTC 

V 


GCA 
A 


TCG 
S 


GAT 
D 


CTG 
L 


CAC 
H 


1246 

102 


1247 
103 


GCG 
A 


CTT 
L 


TCG 

S 


CAA 
Q 


CCG 
P 


CTG 
L 


ACC 
T 


GAT 
D 


CTT 
L 


TCG 
S 


ACT 
T 


CGG 
R 


TGC 
C 


TCG 

S 


CTG 
L 


CCG 
P 


ATC 
I 


AGC 
S 


GGC 
G 


CTG 
L 


1306 
122 


1307 
123 


CAG 
Q 


GCG 
,A 


ATC 
I 


GGT 
G 


TTG 
L 


ATG 
M 


CAG 
Q 


GCG 
A 


GTG 
V 


CAG 
Q 


GGC 
G 


GCC 
A 


CGC 
R 


CGG 
R 


TAG 
■*■ 


ATG CCG GAC CGC CGC 
M P D R R 


1366 
5 


1367 
6 


CGG 
R 


GTC 
V 


CGG 
R 


CGC 
R 


AGT 
S 


CGA 
R 


CGT 
R 


GAG 
E 


GCA 
A 


GCG 
A 


GTC 
V 


GCC 
A 


TAC 
Y 


CGG 
R 


GGC 
G 


GGT 
G 


GTC 
V 


TCG 
S 


CCG 
P 


CCT 
P 


1426 
25 


1427 
26 


TCT 
S 


GGT 
G 


CGC 
R 


AGG 
R 


TCA 
S 


GGG 
G 


GTC 
V 


GGC 
G 


GCT 
A 


GGA 
G 


CCT 
P 


TGC 
C 


GGT 
G 


GTG 
V 


GTT 
V 


TCG 
S 


ACC 
T 


GGG 
G 


TCG 

S 


TCG 
S 


1486 
45 


1487 
46 


CAG 
Q 


GGT 
G 


GTG 
V 


CCC 
P 


TGC 

C 


GGT 

G 


TGG 
W 


ATG 
M 


ACA 
T 


AGT 
S 


CGC 
R 


AGG 
R 


TTT 

F 


GGA 

G 


TCG 

S 


GTT 
V 


GGC 
G 


GGG 
G 


TCG 
S 


CGA 
R 


1546 
65 



1547 TCG TTG T 
66 S L 



SEQ ID No. 2 
FIGURE 2 



REPLACEMENT SHEET (RULE 26) 



7/185 



31/11 

TCG CCG GCT CGC GGA CGT AGA TAA TAG CTC ACC GTT GGA CGA CCT CGA CAG GGT CCT TTG 
ser pro ala arg gly arg arg OCH AMB leu thr val gly arg pro arg gin gly pro leu 
61/21 91/31 

TGA CTG CCG GGC TTG ACG CGG ACG ACC ACA GAG TCG GGT CAT CGC CTA AGG CTA CCG TTC 
OPA leu pro gly leu thr arg thr thr thr glu ser gly his arg leu arg leu pro phe 
121/41 151/51 

TGA CCT GGG GTG CGT GGG CGC CGA CGA GTG AGG CAG TCA TGT CTC AGG GCC CAC CGC CAC 
OPA pro gly val arg gly arg arg arg val arg gin ser cys leu arg ala his arg his 
181/61 211/71 

CTC GGT CGC CGG CAG TGT CAG CAT GTG CAG ATG ACT CCA CGC AGC TTG TTC GTG TTG GTG 
leu qly arq arq qln cys gin his val gin met thr pro arg ser leu phe val leu val 
241/81 271/91 

TCG TGG TTG CGA CGA CTT GGC GCT GGT GAG CGC ACC CGC CGG CGT CGT GCC GCG CAT GCG 
ser trp leu arg arg leu gly ala gly glu arg thr arg arg arg arg ala ala his ala 
301/101 
GAT C 



asp 



SEQ ID No,3A 
FIGURE 3A 



32/11 

CGC CGG CTC GCG GAC GTA GAT AAT AGC TCA CCG TTG GAC GAC CTC GAC AGG GTC CTT TGT 
arg arg leu ala asp val asp asn ser ser pro leu asp asp leu asp arg val leu cys 
62/21 92/31 

GAC TGC CGG GCT TGA CGC GGA CGA CCA CAG AGT CGG GTC ATC GCC TAA GGC TAC CGT TCT 
asp cys arg ala OPA arg gly arg pro gin ser arg val ile ala OCH gly tyr arg ser 
122/41 152/51 

GAC CTG GGG TGC GTG GGC GCC GAC GAG TGA GGC AGT CAT GTC TCA GGG CCC ACC GCC ACC 
asp leu gly cys val gly ala asp glu OPA gly ser his val ser gly pro thr ala thr 
182/61 212/71 

TCG GTC GCC GGC AGT GTC AGC ATG TGC AGA TGA CTC CAC GCA GCT TGT TCG TGT TGG TGT 
ser val ala gly ser val ser met cys arg OPA leu his ala ala cys ser cys trp cys 
242/81 272/91 

CGT GGT TGC GAC GAC TTG GCG CTG GTG AGC GCA CCC GCC GGC GTC GTG CCG CGC ATG CGG 

arg gly cys asp asp leu ala leu val ser ala pro ala gly val val pro arg met arg 

302/101 

ATC 

ile 



SEQ ID No.3B 
FIGURE 3B 



REPLACEMENT SHEET (RULE 26) 



m 
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33/11 


















GCC GGC 


TCG 


CGG 


ACG 


TAG 


ATA 


ATA 


GCT 


CAC 


CGT TGG 


ACG 


ACC 


TCG 


ACA 


GGG 


TCC 


TTT 


GTG 


cilci cjly 


SQL 


arg 


thr 


AMB 


ile 


ile 


ala 


his 


arg trp 


thr 


thr 


ser 


thr 


giy 


ser 


phe 


val 


63/21 


















93/31 


















ACT GCC 


GGG 


CTT 


GAC 


GCG 


GAC 


GAC 


CAC 


AGA 


GTC GGG 


TCA 


TCG 


CCT 


AAG 


GCT 


ACC 


GTT 


CTG 


thr ala 


gly 


leu 


asp 


ala 


asp 


asp 


his 


arg 


val gly 


ser 


ser 


pro 


lys 


ala 


thr 


val 


leu 


123/41 
















153/51 


















ACC TGG 


GGT 


GCG 


TGG 


GCG 


CCG 


ACG 


AGT 


GAG 


GCA GTC 


ATG 


TCT 


CAG 


GGC 


CCA 


CCG 


CCA 


CCT 


thr trp 


gly 


ala 


trp 


ala 


pro 


thr 


ser 


glu 


ala val 


met 


ser 


gin 


gly 


pro 


pro 


pro 


pro 


183/61 
















213/71 


















CGG TCG 


CCG 


GCA 


GTG 


TCA 


GCA 


TGT 


GCA 


GAT 


GAC TCC 


ACG 


CAG 


CTT 


GTT 


CGT 


GTT 


GGT 


GTC 


arg ser 


pro 


ala 


val 


ser 


ala 


cys 


ala 


asp 


asp ser 


thr 


gin 


leu 


val 


arg 


val 


gly 


val 


243/81 
















273/91 


















GTG GTT 


GCG 


ACG 


ACT 


TGG 


CGC 


TGG 


TGA 


GCG 


CAC CCG 


CCG 


GCG 


TCG 


TGC 


CGC 


GCA 


TGC 


GGA 


val val 


ala 


thr 


thr 


trp 


arg 


trp 


OPA 


ala 


his pro 


pro 


ala 


ser 


cys 


arg 


ala 


cys 


gly 



TC 



SEQ ID No.3C 
FIGURE 3C 



31/11 



CCA ATT 


TTC 


CTT 


CGC 


GCC 


GTG 


CAA 


TAC 


CAT 


CTG CAA 


GAC 


CAG 


CGA 


CGG 


CCC 


GTG 


GTT 


GCG 


pro ile 


phe 


leu 


arg 


ala 


val 


gin 


tyr 


his 


leu gin 


asp 


gin 


arg 


arg 


pro 


val 


val 


ala 


61/21 
















91/31 














TGA 




GTC GCG 


CAG 


CTT 


GCG 


GAA 


ACC 


GGG 


TAT 


GGA 


CCC TGC 


CGT 


ACC 


GTT 


GTT 


GCC 


ACT 


TGT 


val ala 


gin 


leu 


ala 


glu 


thr 


gly 


tyr 


gly 


pro cys 


arg 


thr 


val 


val 


ala 


thr 


OPA 


cys 


121/41 
















151/51 


















CGT CGC 


TCT 


CCA 


CCC 


GTC 


GGG 


GGG 


CGA 


AAG 


CCA TTC 


CGA 


CAC 


TGG 


GAT 


CCT 


CAA 


AAC 


GTC 


arg arg^ 


ser 


pro 


pro 


val 


gly 


gly 


arg 


lys 


pro phe 


arg 


his 


trp 


asp 


pro 


gin 


asn 


val 


181/61 


















211/71 


















GGC TGA 


GTG 


TCT 


GCA 


GGG 


CTC 


CGG 


GGA 


GCA 


GCC GAT 


CAT 


CAC 


CAT 


GTA 


CGA 


ACT 


GAA 


TAA 


gly OPA 


val 


ser 


ala 


gly 


leu 


arg 


gly 


ala 


ala asp 


his 


his 


his 


val 


arg 


thr 


glu 


OCH 


241/81 


















271/91 


















GTC CCC 


CGC 


GCG 


CGA 


CTT 


CCA 


GAC 


ATT 


TGT 


TGT GGT 


TTC 


GGT 


TGA 


GGC 


CGA 


GGC 


GAG 


GCT 


val pro 


arg 


ala 


arg 


leu 


pro 


asp 


ile 


cys 


cys gly 


phe 


gly 


OPA 


gly 


arg 


gly 


glu 


ala 


301/101 
















331/111 


















CAT TTC 


GCA 


GCA 


ACC 


GGT 


CTC 


CGG 


GTC 


GCA 


GCA TCG 


TTG 


CGG 


CGA 


TCG 


CGG 


CGC 


AGT 


CGT 


his phe 


ala 


ala 


thr 


gly 


leu 


arg 


val 


ala 


ala ser 


leu 


arg 


arg 


ser 


arg 


arg 


ser 


arg 


361/121 




































CGG ACG 


AGT 


CGT 


CGT 


CAA 


CGA 


CCA 


CGA 


TC 




















arg thr 


ser 


arg 


arg 


gin 


arg 


pro 


arg 























SEQ ID No-4A 
FIGURE 4A 
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CAA TTT TCC TTC GCG CCG TGC AAT ACC ATC 
gin phe ser phe ala pro cys asn thr ile 
62/21 

TCG CGC AGC TTG CGG AAA CCG GGT ATG GAC 
ser arg ser leu arg lys pro gly met asp 
122/41 

GTC GCT CTC CAC CCG TCG GGG GGC GAA AGC 
val ala leu his pro ser gly gly glu ser 
182/61 

GCT GAG TGT CTG CAG GGC TCC GGG GAG CAG 
ala glu cys leu gin gly ser gly glu gin 
242/81 

TCC CCC GCG CGC GAC TTC CAG ACA TTT GTT 
ser pro ala arg asp phe gin thr phe val 
302/101 

ATT TCG CAG CAA CCG GTC TCC GGG TCG CAG 
ile ser gin gin pro val ser gly ser gin 
362/121 

GGA CGA GTC GTC GTC AAC GAC CAC GAT C 
gly arg val val val asn asp his asp 



32/11 

TGC AAG ACC AGC GAC GGC CCG TGG TTG. CGG 

cys lys thr ser asp gly pro trp leu arg 
92/31 

CCT GCC GTA CCG TTG TTG CCA CTT GAT GTC 

pro ala val pro leu leu pro leu asp val 
152/51 

CAT TCC GAC ACT GGG ATC CTC AAA ACG TCG 

his ser asp thr gly ile leu lys thr ser 
-212/71 

CCG ATC ATC ACC ATG TAC GAA CTG AAT AAG 

pro ile ile thr met tyr glu leu asn lys 
272/91 

GTG GTT TCG GTT GAG GCC GAG GCG AGG CTC 

val val ser val glu ala glu ala arg leu 
332/111 

CAT CGT TGC GGC GAT CGC GGC GCA GTC GTC 

his arg cys gly asp arg gly ala val val 



SEQ ID No.4B 
FIGURE 4B 



AAT TTT CCT TCG CGC CGT GCA ATA CCA 
asn phe pro ser arg arg ala ile pro 
63/21 

CGC GCA GOT TGC GGA AAC CGG GTA TGG 
arg ala ala cys gly asn arg val trp 
123/41 

TCG CTC TCC ACC CGT CGG GGG GCG AAA 
ser leu ser thr arg arg gly ala lys 
183/61 

CTG AGT GTC TGC AGG GCT CCG GGG AGC 
leu ser val cys arg ala pro gly ser 
243/81 

CCC CCG CGC GCG ACT TCC AGA CAT TTG 
pro pro arg ala thr ser arg his leu 
303/101 

TTT CGC AGC AAC CGG TCT CCG GGT CGC 
phe arg ser asn arg ser pro gly arg 
363/121 

GAC GAG TCG TCG TCA ACG ACC ACG ATC 
asp glu ser ser ser thr thr thr ile 



33/11 

TCT GCA AGA CCA GCG ACG GCC CGT GGT TGC GGT 
ser ala arg pro ala thr ala arg gly cys gly 
93/31 

ACC CTG CCG TAC CGT TGT TGC CAC TTG ATG TCG 
thr leu pro tyr arg cys cys his leu met ser 
153/51 

GCC ATT CCG ACA CTG GGA TCC TCA AAA CGT CGG 
ala ile pro thr leu gly ser ser lys arg arg 
213/71 

AGC CGA TCA TCA CCA TGT ACG AAC TGA ATA AGT 
ser arg ser ser pro cys thr asn OPA ile ser 
273/91 

TTG TGG TTT CGG TTG AGG CCG AGG CGA GGC TCA 
leu trp phe arg leu arg pro arg arg gly ser 
333/111 

AGC ATC GTT GCG GCG ATC GCG GCG CAG TCG TCG 
ser ile val ala ala ile ala ala gin ser ser 



SEQ ID No.4C 

FIGURE 4C 
REPLACEMENT SHEET (RULE 26) 



m 
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part of the nucleotide sequence of seq4A 

1/1 31/11 

CCG CGC GCG ACT TCC » AGA CAT TTG TTG TGG TTT CGG TTG AGG CCG AGG CGA GGC TCA TTT 
pro arg ala thr ser arg his leu leu trp phe arg leu arg pro arg arg gly ser phe 
61/21 91/31 

CGC AGC AAG CGG TCT CCG GGT CGC AGC ATC GTT GCG GCG ATC GCG GCG CAG TCG TCG GAC 
arg ser lys arg ser pro gly arg ser ile val ala ala ile ala ala gin ser ser asp 
121/41 

GAG TCG TCG TCA ACG ACC ACG ATC 
glu ser ser ser thr thr thr ile 



SEQ ID No.4A f 
FIGURE 4A' 



1/1 31/11 

CGC GCG CGA CTT CCA GAC ATT TGT TGT GGT TTC GGT TGA GGC CGA GGC GAG GCT CAT TTC 
arg ala arg leu pro asp ile cys cys gly phe gly OPA gly arg gly glu ala his phe 
61/21 91/31 

GCA GCA AGC GGT. CTC CGG GTC GCA GCA TCG TTG CGG CGA TCG CGG CGC AGT CGT CGG ACG 
ala ala ser gly leu arg val ala ala ser leu arg arg ser arg arg ser arg arg thr 
121/41 

AGT CGT CGT CAA CGA CCA CGA TC 
ser arg arg gin arg pro arg 



SEQ ID NO-4B 1 
FIGURE 4B 1 



1/1 

GCC GCG CGC GAC TTC CAG ACA 
ala ala arg asp phe gin thr 
61/21 

TCG CAG CAA GCG GTC TCC GGG 
ser gin gin ala val ser gly 
121/41 

CGA GTC GTC GTC AAC GAC CAC 
arg val val val asn asp his 



31/11 

TTT GTT GTG GTT TCG GTT 
phe val val val ser val 
91/31 

TCG CAG CAT CGT TGC GGC 
ser gin his arg cys gly 

GAT C 
asp 



SEQ ID No.4C f 
FIGURE 4C f 



GAG GCC GAG GCG AGG CTC ATT 
glu ala glu ala arg leu ile 

GAT CGC GGC GCA GTC GTC GGA 
asp arg gly ala val val gly 
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ORF according to Cole et al . (Nature 393:537-544) and containing the sequence 
Seq 4A* 

1/1 31/11 

tga ata agt ccg ccg cgc gcg act tec aga cat ttg ttg tgg ttt egg ttg agg ccg agg 
OPA ile ser pro pro arg ala thr ser arg his leu leu trp phe arg leu arg pro arg 
61/21 91/31 

cga ggc tea ttt cgc age aag egg tct ccg ggt cgc age ate gtt gcg gcg ate gcg gcg 
arg gly ser phe arg ser lys arg ser pro gly arg ser ile val ala ala ile ala ala 
121/41 151/51 

cag teg teg gac gag teg teg tea acg ace acg ate teg aac teg acg ccc tec tgt teg 
gin ser ser asp glu ser ser ser thr thr thr ile ser asn ser thr pro ser cys ser 
181/61 211/71 

agg atg eta cgc aga cag cgc teg atg gtg gcg ccg ttg ttg tac ate ggg atg cac ace 
arg met leu arg arg gin arg ser met val ala pro leu leu tyr ile gly met his thr 
241/81 271/91 

gag ata age ggt ttc gee ggg ttc ace gat acc acg ctt gat gca tea cca ggc ace aca 
glu ile ser gly phe ala gly phe thr asp thr thr leu asp ala ser pro gly thr thr 
301/101 

tgg cga etc aga gac tag 
trp arg leu arg asp AMB 



SEQ ID No.4F 
FIGURE 4F 



sequence upstream of seq4A* 
1/1 

GCA ACC TAC CAG CAG AGC CAG 
ala thr tyr gin gin ser gin 



and fused with seq4A f 
31/11 

GGG CTC ACA GGA CCT AAA 
gly leu thr gly pro lys 



GGA GTA GCG CCC ATG GCT GAT 
gly val ala pro met ala asp 



C 

SEQ ID No. 4 J 



FIGURE 4J 



seq4J T in another reading frame 

1/1 31/11 

ACG CAA CCT ACC AGC AGA GCC AGG GGC TCA CAG GAC CTA AAG GAG TAG CGC CCA TGG CTG 

thr gin pro thr ser arg ala arg gly ser gin asp leu lys glu AMB arg pro trp leu 

61/21 

ATC 

ile 

SEQ ID N°4K 
FIGURE 4K 



seq 4 J 1 in the third reading frame 

1/1 31/11 

CGC AAC CTA CCA GCA GAG CCA GGG GCT CAC AGG ACC TAA AGG AGT AGC GCC CAT GGC TGA 

arg asn leu pro ala glu pro gly ala his arg thr OCH arg ser ser ala his gly OPA 

TC 

SEQ ID NO-4L 



FIGURE 4L 
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sequence Rv2050 predicted by Cole et al . (Nature 393:537-544) and containing 
seq4 J 



1/1 


















31/11 


















ATG GCT 


GAT 


CGT 


GTC 


CTG 


AGG 


GGC 


AGT 


CGC 


CTC GGA 


GCC 


GTG 


AGC 


TAT 


GAG 


ACC 


GAC 


CGC 


Met ala 


asp 


arg 


val 


leu 


arg 


gly 


ser 


arg 


leu gly 


ala 


val 


ser 


tyr 


glu 


thr 


asp 


arg 


61/21 














91/31 


















AAC CAC 


GAC 


CTG 


GCG 


CCG 


CGC 


CAG 


ATC 


GCG 


CGG TAC 


CGC 


ACC 


GAC 


AAC 


GGC 


GAG 


GAG 


TTC 


asn his 


asp 


leu 


ala 


pro 


arg 


gin 


ile 


ala 


arg tyr 


arg 


thr 


asp 


asn 


gly 


glu 


glu 


phe 


121/41 
















151/51 


















GAA GTC 


CCG 


TTC 


GCC 


GAT 


GAC 


GCC 


GAG 


ATC 


CCC GGC 


ACC 


TGG 


TTG 


TGC 


CGC 


AAC 


GGC 


ATG 


glu val 


pro 


phe 


ala 


asp 


asp 


ala 


glu 


ile 


pro gly 


thr 


trp 


leu 


cys 


arg 


asn 


gly 


met 


181/61 
















211/71 


















GAA GGC 


ACC 


CTG 


ATC 


GAG 


GGC 


GAC 


CTG 


CCC 


GAG CCG 


AAG 


AAG 


GTT 


AAG 


CCG 


CCC 


CGG 


ACG 


glu gly 


thr 


leu 


ile 


glu 


gly 


asp 


leu 


pro 


glu pro 


lys 


lys 


val 


lys 


pro 


pro 


arg 


thr 


241/81 


















271/91 


















CAC TGG 


GAC 


ATG 


CTG 


CTG 


GAG 


CGC 


CGT 


TCC 


ATC GAA 


GAA 


CTC 


GAA 


GAG 


TTA 


CTT 


AAG 


GAG 


his trp 


asp 


met 


leu 


leu 


glu 


arg 


arg 


ser 


ile glu 


glu 


leu 


glu 


glu 


leu 


leu 


lys 


glu 


301/101 
















331/111 


















CGC CTC 


GAG 


CTC 


ATT 


CGG 


TCA 


CGT 


CGG 


CGC 


GGC TGA 


















arg leu 


glu 


leu 


ile 


arg 


ser 


arg 


arg 


arg 


gly OPA 



















SEQ ID No.4M 
FIGURE 4M 

ORF according to Cole et al. (Nature 393:537-544) and containing the sequence 
Rv2050 



1/1 31/11 



TAG TCC 


GCC 


CGG 


GTG 


TCC 


GAT 


CCC 


GGT 


ATC 


ATT GAT 


GGT 


CGC 


GCC 


GCG 


CGC 


GTC 


GCG 


TGC 


AMB ser 


ala 


arg 


val 


ser 


asp 


pro 


gly 


ile 


ile asp 


gly 


arg 


ala 


ala 


arg 


val 


ala 


cys 


61/21 
















91/31 


















CGG GAA 


CTA 


CGC 


AGA 


CGG 


CCG 


CAG 


CGT 


TTG 


CCA ACC 


GGA 


GCC 


AGT 


CGC 


CAG 


TAC 


GCA 


ACC 


arg glu 


leu 


arg 


arg 


arg 


pro 


gin 


arg 


leu 


pro thr 


gly 


ala 


ser 


arg 


gin 


tyr 


ala' 


thr 


121/41 


















151/51 


















TAC CAG 


CAG 


AGC 


CCA 


GGG 


CTC 


ACA 


GGA 


CCT 


AAA GGA 


GTA 


GCG 


CCC 


ATG 


GCT 


GAT 


CGT 


GTC 


tyr gin 


gin 


ser 


pro 


gly 


leu 


thr 


gly 


pro 


lys L gly 


val 


ala 


pro 


met 


ala 


asp 


arg 


val 


181/61 
















211/71 


















CTG AGG 


GGC 


AGT 


CGC 


CTC 


GGA 


GCC 


GTG 


AGC 


TAT GAG 


ACC 


GAC 


CGC 


AAC 


CAC 


GAC 


CTG 


GCG 


leu arg 


gly 


ser 


arg 


leu 


gly 


ala 


val 


ser 


tyr glu 


thr 


asp 


arg 


asn 


his 


asp 


leu 


ala 


241/81 
















271/91 


















CCG CGC 


CAG 


ATC 


GCG 


CGG 


TAC 


CGC 


ACC 


GAC 


AAC GGC 


GAG 


GAG 


TTC 


GAA 


GTC 


CCG 


TTC 


GCC 


pro arg 


gin 


ile 


ala 


arg 


tyr 


arg 


thr 


asp 


asn gly 


glu 


glu 


phe 


glu 


val 


pro 


phe 


ala 


301/101 
















331/111 
















ATC 


GAT GAC 


GCC 


GAG 


ATC 


CCC 


GGC 


ACC 


TGG 


TTG 


TGC CGC 


AAC 


GGC 


ATG 


GAA 


GGC 


ACC 


CTG 


asp asp 


ala 


glu 


ile 


pro 


gly 


thr 


trp 


leu 


cys arg 


asn 


gly 


met 


glu 


gly 


thr 


leu 


ile 


361/121 
















391/131 
















CTG 


GAG GGC 


GAC 


CTG 


CCC 


GAG 


CCG 


AAG 


AAG 


GTT 


AAG CCG 


CCC 


CGG 


ACG 


CAC 


TGG 


GAC 


ATG 


glu gly 


asp 


leu 


pro 


glu 


pro 


lys 


lys 


val 


lys pro 


pro 


arg 


thr 


-his 


trp 


asp 


met 


leu 


421/141 


















451/151 
















ATT 


CTG GAG 


CGC 


CGT 


TCC 


ATC 


GAA 


GAA 


CTC 


GAA 


GAG TTA 


CTT 


AAG 


GAG 


CGC 


CTC 


GAG 


CTC 


leu glu 


arg 


arg 


ser 


ile 


glu 


glu 


leu 


glu 


glu leu 


leu 


lys 


glu 


arg 


leu 


glu 


leu 


ile 


481/161 




































CGG TCA 


CGT 


CGG 


CGC 


GGC 


TGA 


























arg ser 


arg 


arg 


arg 


gly 


OPA 



























SEQ ID No.4N 



FIGURE 4 



REPLACEMENT SHEET (RULE 2d) 




13/185 





















31/11 


















GAT CGC 


GGT 


CAA 


CGA 


GGC 


CGA 


ATA 


CGG 


CGA 


GAT GIG 


GGC 


CCA 


a r* a 


C_G(_ 


CG(- 




/-"AT 1 
GAi 


GTT 


asp arg 


gly 


gin 


arg 


gly 


arg 


ile 


arg 


arg 


asp val 


gly pro 


arg 


arg 


arg 


arg 


asp 


val 


61/21 


















91/31 


















TGG CTA 


CGC 


CGC 


GGC 


GAC 












GTT 


GCT 


GCC 


GTT 


CGA 


GGA 


GGC 


GCC 


trp leu 


arg 


arg 


gly 


asp 


gly 


asp 


gly 


asp 


gly asp 


val 


ala 


ala 


val 


arg 


gly 


gly 


ala 


121/41 


















151/51 


















GGA GAT 


GAC 


CAG 


CGC 


GGG 


TGG 


GCT 


CCT 


CGA 


GCA GGC 


CGC 


CGC 


GGT 


CGA 


GGA 


GGC 


CTC 


CGA 


gly asp 


asp 


gin 


arg 


gly 


trp 


ala 


pro 


arg 


ala gly 


arg 


arg 


gly 


arg 


gly 


gly 


leu 


arg 


181/61 


















211/71 


















CAC CGC 


CGC 


GGC 


GAA 


CCA 


GTT 


GAT 


GAA 


CAA 


TGT GCC 


CCA 


GGC 


GCT 


GCA 


ACA 


GCT 


GGC 


CCA 


his arg 


arg 


gly 


glu 


pro 


val 


asp 


glu 


gin 


cys ala 


pro 


gly 


ala 


ala 


thr 


ala 


gly 


pro 


241/81 














271/91 


















GCC CAC 


GCA 


GGG 


CAC 


CAC 


GCC 


TTC 


TTC 


CAA 


GCT GGG 


TGG 


CCT 


GTG 


GAA 


GAC 


GGT 


CTC 


GCC 


ala his 


ala 


gly 


his 


his 


ala 


phe 


phe 


gin 


ala gly 


trp 


pro 


val 


glu 


asp 


gly 


leu 


ala 



301/101 

GCA TCG GTC GCC GAT C 
ala ser val ala asp 



SEQ ID No.5A 
FIGURE 5A 



32/11 



ATC GCG 


GTC 


AAC 


GAG 


GCC 


GAA 


TAC 


GGC 


GAG 


ATG TGG 


GCC 


CAA 


GAC 


GCC 


GCC 


GCG 


ATG 


TTT 


ile ala 


val 


asn 


glu 


ala 


glu 


tyr 


gly 


glu 


met trp 


ala 


gin 


asp 


ala 


ala 


ala 


met 


phe 


62/21 


















92/31 


















GGC TAC 


GCC 


GCG 


GCG 


ACG 


GCG 


ACG 


GCG 


ACG 


GCG ACG 


TTG 


CTG 


CCG 


TTC 


GAG 


GAG 


GCG 


CCG 


gly tyr 


ala 


ala 


ala 


thr 


ala 


thr 


ala 


thr 


ala thr 


leu 


leu 


pro 


phe 


glu 


glu 


ala 


pro 


122/41 


















152/51 


















GAG ATG 


ACC 


AGC 


GCG 


GGT 


GGG 


CTC 


CTC 


GAG 


CAG GCC 


GCC 


GCG 


GTC 


GAG 


GAG 


GCC 


TCC 


GAC 


glu met 


thr 


ser 


ala 


gly 


gly 


leu 


leu 


glu 


gin ala 


ala 


ala 


val 


glu 


glu 


ala 


ser 


asp 


182/61 


















212/71 


















ACC GCC 


GCG 


GCG 


AAC 


CAG 


TTG 


ATG 


AAC 


AAT 


GTG CCC 


CAG 


GCG 


CTG 


CAA 


CAG 


CTG 


GCC 


CAG 


thr ala 


ala 


ala 


asn 


gin 


leu 


met 


asn 


asn 


val pro 


gin 


ala 


leu 


gin 


gin 


leu 


ala 


gin 


242/81 
















272/91 


















CCC ACG 


CAG 


GGC 


ACC 


ACG 


CCT 


TCT 


TCC 


AAG 


CTG GGT 


GGC 


CTG 


TGG 


AAG 


,ACG 


GTC 


TCG 


CCG 


pro thr 


gin 


gly 


thr 


thr 


pro 


ser 


ser 


lys 


leu gly 


gly 


leu 


trp 


lys 


thr 


val 


ser 


pro 



302/101 

CAT CGG TCG CCG ATC 
his arg ser pro ile 



SEQ ID No.5B 
FIGURE 5B 



REPLACEMENT SHEET (RULE 26) 



14/185 



33/11 



TCG CGG 


TCA 


ACG 


AGG 


CCG 


AAT 


ACG 


GCG 


AGA 


TGT GGG 


CCC 


AAG 


ACG 


CCG 


CCG 


CGA 


TGT 


TTG 


ser arg 


ser 


thr 


arg 


pro 


asn 


thr 


ala 


arg 


cys gly 


pro 


lys 


thr 


pro 


pro 


arg 


cys 


leu 


63/21 


















93/ 31 


















GCT ACG 


CCG 


CGG 


CGA 


CGG 


CGA 


CGG 


CGA 


CGG 


CGA CGT 


TGC 


TGC 


CGT 


TCG 


AGG 


AGG 


CGC 


CGG 


ala thr 


pro 


arg 


arg 


arg 


arg 


arg 


arg 


arg 


arg arg 


cys 


cys 


arg 


ser 


arg 


arg 


arg 


arg 


123/41 
















153/51 


















AGA TGA 


CCA 


GCG 


CGG 


GTG 


GGC 


TCC 


TCG 


AGC 


AGG CCG 


CCG 


CGG 


TCG 


AGG 


AGG 


CCT 


CCG 


ACA 


arg OPA 


pro 


ala 


arg 


val 


gly 


ser 


ser 


ser 


arg pro 


pro 


arg 


ser 


arg 


arg 


pro 


pro 


thr 


183/61 














213/71 


















CCG CCG 


CGG 


CGA 


ACC 


AGT 


TGA 


TGA 


ACA 


ATG 


TGC CCC 


AGG 


CGC 


TGC 


AAC 


AGC 


TGG 


CCC 


AGC 


pro pro 


arg 


arg 


thr 


ser 


OPA 


OPA 


thr 


met 


cys pro 


arg 


arg 


cys 


asn 


ser 


trp 


pro 


ser 


243/81 














273/91 


















CCA CGC 


AGG 


GCA 


CCA 


CGC 


CTT 


CTT 


CCA 


AGC 


TGG GTG 


GCC 


TGT 


GGA 


AGA 


CGG 


TCT 


CGC 


CGC 


pro arg 


arg 


ala 


pro 


arg 


leu 


leu 


pro 


ser 


trp val 


ala 


cys 


gly 


arg 


arg 


ser 


arg 


arg 



303/101 

ATC GGT CGC CGA TC 
ile gly arg arg 



SEQ ID No.BC 
FIGURE 5C 



part of the nucleotide sequence Seq 5A 



1/1 31/11 



CGC CGC 


GGC 


GAC 


GGC 


GAC 


GGC 


GAC 


GGC 


GAC 


GTT GCT 


GCC 


GTT 


CGA 


GGA 


GGC 


GCC 


GGA 


GAT 


arg arg 


gly 


asp 


gly 


asp 


gly 


asp 


gly 


asp 


val ala 


ala 


val 


arg 


gly 


gly 


ala 


gly 


asp 


61/21 


















91/31 


















GAC CAG 


CGC 


GGG 


TGG 


GCT 


CCT 


CGA 


GCA 


GGC 


CGC CGC 


GGT 


CGA 


GGA 


GGC 


CTC 


CGA 


CAC 


CGC 


asp gin 


arg 


gly 


trp 


ala 


pro 


arg 


ala 


gly 


arg arg 


gly 


arg 


gly 


gly 


leu 


arg 


his 


arg 


121/41 














151/51 


















CGC GGC 


GAA 


CCA 


GTT 


GAT 


GAA 


CAA 


TGT 


GCC 


CCA GGC 


GCT 


GCA 


ACA 


GCT 


GGC 


CCA 


GCC 


CAC 


arg gly 


glu 


pro 


val 


asp 


glu 


gin 


cys 


ala 


pro gly 


ala 


ala 


thr 


ala 


gly 


pro 


ala 


his 


181/61 
















271/71 


















GCA GGG 


CAC 


CAC 


GCC 


TTC 


TTC 


CAA 


GCT 


GGG 


TGG CCT 


GTG 


GAA 


GAC 


GGT 


CTC 


GCC 


GCA 


TCG 


ala gly 


his 


his 


ala 


phe 


phe 


gin 


ala 


gly 


trp pro 


val 


glu 


asp 


gly 


leu 


ala 


ala 


ser 



241/81 

GTC GCC GAT C 
val ala asp 



SEQ ID No-5A f 
FIGURE 5A f 



REPLACEMENT SHEET (RULE 26) 




15/185 



1/1 31/11 

TAC GCC GCG GCG ACG GCG ACG GCG ACG GCG ACG TTG CTG CCG TTC GAG GAG GCG CCG GAG 
tyr ala ala ala thr ala thr ala thr ala thr leu leu pro phe glu glu ala pro glu 
61/21 91/31 

ATG ACC AGC GCG GGT GGG CTC CTC GAG CAG GCC GCC GCG GTC GAG GAG GCC TCC GAC ACC 
met thr ser ala gly gly leu leu glu gin ala ala ala val glu glu ala ser asp thr 
121/41 151/51 

GCC GCG GCG AAC CAG TTG ATG AAC AAT GTG CCC CAG GCG CTG CAA CAG CTG GCC CAG CCC 
ala ala ala asn gin leu met asn asn val pro gin ala leu gin gin leu ala gin pro 
181/61 211/71 

ACG CAG GGC ACC ACG CCT TCT TCC AAG CTG GGT GGC CTG TGG AAG ACG GTC TCG CCG CAT 
thr gin gly thr thr pro ser ser lys leu gly gly leu trp lys thr val ser pro his 
241/81 

CGG TCG CCG ATC 
arg ser pro ile 



SEQ ID No.5B f 
FIGURE 5B' 



1/1 31/11 

ACG CCG CGG CGA CGG CGA CGG CGA CGG CGA CGT TGC TGC CGT TCG AGG AGG CGC CGG AGA 
thr pro arg arg arg arg arg arg arg arg arg cys cys arg ser arg arg arg arg arg 
61/21 91/31 

TGA CCA GCG CGG GTG GGC TCC TCG AGC AGG CCG CCG CGG TCG AGG AGG CCT CCG ACA CCG 
OPA pro ala arg val gly ser ser ser arg pro pro arg ser arg arg pro pro thr pro 
121/41 151/51 

CCG CGG CGA ACC AGT TGA TGA ACA ATG TGC CCC AGG CGC TGC AAC AGC TGG CCC AGC CCA 
pro arg arg thr ser OPA OPA thr met cys pro arg arg cys asn ser trp pro ser pro 
181/61 211/71 

CGC AGG GCA CCA CGC CTT CTT CCA AGC TGG GTG GCC TGT GGA AGA CGG TCT CGC CGC ATC 
arg arg ala pro arg leu leu pro ser trp val ala cys gly arg arg ser arg arg ile 
241/81 

GGT CGC CGA TC 
gly arg arg 



SEQ ID No.5C f 
FIGURE 5C f 



REPLACEMENT SHEET (RULE 26) 



m 



16/185 



ORF predicted by Cole et al . (Nature 393:537-544) and containing seq5A' 
1/1 31/11 

tga act gat gat tct gat age gac caa cct ctt ggg gca aaa cac ccc ggc gat cgc ggt 
OPA thr asp asp ser asp ser asp gin pro leu gly ala lys his pro gly asp arg gly 
61/21 91/31 

caa cga ggc cga ata egg cga gat gtg ggc cca aga cgc cgc cgc gat gtt tgg eta cgc 
gin arg gly arg ile arg arg asp val gly pro arg arg arg arg asp val trp leu arg 
121/41 151/51 

cgc ggc gac ggc gac ggc gac ggc gac gtt get gec gtt cga gga ggc gec gga gat gac 
arg gly asp gly asp gly asp gly asp val ala ala val arg gly gly ala gly asp asp 
181/61 211/71 

cag cgc ggg tgg get cct cga gca ggc cgc cgc ggt cga gga ggc etc cga cac cgc cgc 
gin arg gly trp ala pro arg ala gly arg arg gly arg gly gly leu arg his arg arg 
241/81 271/91 

ggc gaa cca gtt gat gaa caa tgt gec cca ggc get gca aca get ggc cca gee cac gca 
gly glu pro val asp glu gin cys ala pro gly ala ala thr ala gly pro ala his ala 
301/101 331/111 

ggg cac cac gee ttc ttc caa get ggg tgg cct gtg gaa gac ggt etc gee gca teg gtc 
gly his his ala phe phe gin ala gly trp pro val glu asp gly leu ala ala ser val 
361/121 391/131 

gec gat cag caa cat ggt gtc gat ggc caa caa cca cat gtc gat gac caa etc ggg tgt 
ala asp gin gin his gly val asp gly gin gin pro his val asp asp gin leu gly cys 
421/141 451/151 

gtc gat gac caa cac ctt gag etc gat gtt gaa ggg ctt tgc tec ggc ggc ggc cgc cca 
val asp asp gin his leu glu leu asp val glu gly leu cys ser gly gly gly arg pro 
481/161 511/171 

ggc cgt gca aac cgc ggc gca aaa egg ggt ccg ggc gat gag etc get ggg cag etc get 
gly arg ala asn arg gly ala lys arg gly pro gly asp glu leu ala gly gin leu ala 
541/181 571/191 

ggg ttc ttc ggg tct ggg egg tgg ggt ggc cgc caa ctt ggg teg ggc ggc etc ggt egg 
gly phe phe gly ser gly arg trp gly gly arg gin leu gly ser gly gly leu gly arg 
601/201 631/211 

ttc gtt gtc ggt gee gca ggc ctg ggc cgc ggc caa cca ggc agt cac ccc ggc ggc gcg 
phe val val gly ala ala gly leu gly arg gly gin pro gly ser his pro gly gly ala 
661/221 691/231 

ggc get gec get gac cag cct gac cag cgc cgc gga aag agg gee egg gca gat get ggg 
gly ala ala ala asp gin pro asp gin arg arg gly lys arg ala arg ala asp ala gly 
721/241 751/251 

egg get gec ggt ggg gca gat ggg cgc cag ggc egg tgg tgg get cag tgg tgt get gcg 
arg ala ala gly gly ala asp gly arg gin gly arg trp trp ala gin trp cys ala ala 
781/261 811/271 

tgt tec gee gcg ace eta tgt. gat gec gca ttc tec ggc ggc egg eta gga gag ggg gcg 
cys ser ala ala thr leu cys asp ala ala phe ser gly gly arg leu gly glu gly ala 
841/281 

cag act gtc gtt att tga 
gin thr val val ile OPA 



SEQ ID No.SF 
FIGURE 5F 



REPLACEMENT SHEET (RULE 26) 



# • 



17/185 

sequence Rvll96 predicted by Cole et al . (Nature 393:537-544) and capable of encoding an 
ORF fused with Seq5A' 

1/1 31/11 

atg gtg gat ttc ggg gcg tta cca ccg gag ate aac tecgeg agg atg tac gec ggc ccg 
Met val asp phe gly ala leu pro pro glu ile asn ser ala arg met tyr ala gly pro 
61/21 91/31 

ggt teg gec teg ctg gtg gee gcg get cag atg tgg gac age gtg gcg agt gac ctg ttt 
gly ser ala ser leu val ala ala ala gin met trp asp ser val ala ser asp leu phe 
121/41 151/51 

teg gee gcg teg gcg ttt cag teg gtg gtc tgg ggt ctg acg gtg ggg teg tgg ata ggt 
ser ala ala ser ala phe gin ser val val trp gly leu thr val gly ser trp ile gly 
181/61 211/71 

teg teg gcg ggt ctg atg gtg gcg gcg gee teg ccg tat gtg gcg tgg atg age gtc acc 
ser ser ala gly leu met val ala 

ala ala ser pro tyr val ala trp met ser val thr 
241/81 271/91 

gcg ggg cag gec gag ctg acc gee gec cag gtc egg gtt get gcg gcg gee tac gag acg 
ala gly gin ala glu leu thr ala ala gin val arg val ala ala ala ala tyr glu thr 
301/101 331/111 

gcg tat ggg ctg acg gtg ccc ccg ccg gtg ate gec gag aac cgt get gaa ctg atg att 
ala tyr gly leu thr val pro pro pro val ile ala glu asn arg ala glu leu met ile 
361/121 391/131 

ctg ata gcg acc aac etc ttg ggg caa aac acc ccg gcg ate gcg gtc aac gag gee gaa 
leu ile ala thr asn leu leu gly gin asn thr pro ala ile ala val asn glu ala glu 
421/141 451/151 

tac ggc gag atg tgg gee caa gac gec gee gcg atg ttt ggc tac gee gcg gcg acg gcg 
tyr gly glu met trp ala gin asp ala ala ala met phe gly tyr ala ala ala thr ala 
481/161 511/171 

acg gcg acg gcg acg ttg ctg ccg ttc gag gag gcg ccg gag atg acc age gcg ggt ggg 
thr ala thr ala thr leu leu pro phe glu glu ala pro glu met thr ser ala gly gly 
541/181 571/191 

etc etc gag cag gee gee gcg gtc gag gag gee tec gac acc gee gcg gcg aac cag ttg 
leu leu glu gin ala ala ala val glu glu ala ser asp thr ala ala. ala asn gin leu 
601/201 631/211 

atg aac aat gtg ccc cag gcg ctg caa cag ctg gee cag ccc acg cag ggc acc acg cct 
met asn asn val pro gin ala leu gin gin leu ala gin pro thr gin gly thr thr pro 
661/221 691/231 

tct tec aag ctg ggt ggc ctg tgg aag acg gtc teg ccg cat egg teg ccg ate age aac 
ser ser lys leu gly gly leu trp lys thr val ser pro his arg ser pro ile ser asn 
721/241 751/251 

atg gtg teg atg gee aac aac cac atg teg atg acc aac teg ggt gtg teg atg acc aac 
met val ser met ala asn asn his met ser met thr asn ser gly val ser met thr asn 
781/261 811/271 

acc ttg age teg atg ttg aag ggc ttt get ccg gcg gcg gee gee cag gee gtg caa acc 
thr leu ser ser met leu lys gly phe ala pro ala ala ala ala gin ala val gin thr 
841/281 871/291 

gcg gcg caa aac ggg gtc egg gcg atg age teg ctg ggc age teg ctg ggt tct teg ggt 
ala ala gin asn gly val arg ala met ser ser leu gly ser ser leu gly ser ser gly 
901/301 931/311 

ctg ggc ggt ggg gtg gee gee aac ttg ggt egg gcg gec teg gtc ggt teg ttg teg gtg 
leu gly gly gly val ala ala asn leu gly arg ala ala ser val gly ser leu ser val 
961/321 991/331 

ccg cag gee tgg gee gcg gee aac cag gca gtc acc ccg gcg gcg egg gcg ctg ccg ctg 
pro gin ala trp ala ala ala asn gin ala val thr pro ala ala arg ala leu pro leu 
1021/341 1051/351 

acc age ctg acc age gee gcg gaa aga ggg ccc ggg cag atg ctg ggc ggg ctg ccg gtg 
thr ser leu thr ser ala ala glu arg gly pro gly gin met leu gly gly leu pro val 
1081/361 1111/371 

ggg cag atg ggc gee agg gec ggt ggt ggg etc agt ggt gtg ctg cgt gtt ccg ccg cga 
gly gin met gly ala arg ala gly gly gly leu ser gly val leu arg val pro pro arg 
1141/381 1171/391 
ccc tat gtg atg ccg cat tct ccg gcg gee ggc tag 
pro tyr val met pro his ser pro ala ala gly AMB 

SEQ ID No.5R 



FIGURE 5R 



REPLACEMENT SHEET (RULE 26) 




V 



18/185 

Seq 5P: ORF according to Cole et al. (Nature 393:537-544) and containing the sequence 
Rvll96 

1/1 31/11 

tag gga cac gta atg gtg gat ttc ggg gcg tta cca ccg gag ate aac tec gcg agg atg 
AMB gly his val met val asp phe gly ala leu pro pro glu ile asn ser ala arg met 
61/21 91/31 

tac gec ggc ccg ggt teg gec teg ctg gtg gee gcg get cag atg tgg gac age gtg gcg 
tyr ala gly pro gly ser ala ser leu val ala ala ala gin met trp asp ser val ala 
121/41 151/51 

agt gac ctg ttt teg gec gcg teg gcg ttt cag teg gtg gtc tgg ggt ctg acg gtg ggg 
ser asp leu phe ser ala ala ser ala phe gin ser val val trp gly leu thr val gly 
181/61 211/71 

teg tgg ata ggt teg teg gcg ggt ctg atg gtg gcg gcg gee teg ccg tat gtg gcg tgg 
ser trp ile gly ser ser ala gly leu met val ala ala ala ser pro tyr val ala trp 
241/81 271/91 

atg age gtc acc gcg ggg cag gee gag ctg ace gec gee cag gtc egg gtt get gcg gcg 
met ser val thr ala gly gin ala glu leu thr ala ala gin val arg val ala ala ala 
301/101 331/111 

gec tac gag acg gcg tat ggg ctg acg gtg ccc ccg ccg gtg ate gec gag aac cgt get 
ala tyr glu thr ala tyr gly leu thr val pro pro pro val ile ala glu asn arg ala 
361/121 391/131 

gaa ctg atg att ctg ata gcg acc aac etc ttg ggg caa aac acc ccg gcg ate gcg gtc 
glu leu met ile leu ile ala thr asn leu leu gly gin asn thr pro ala ile ala val 
421/141 451/151 

aac gag gee gaa tac ggc gag atg tgg gee caa gac gee gee gcg atg ttt ggc tac gee 
asn glu ala glu tyr gly glu met trp ala gin asp ala ala ala met phe gly tyr ala 
481/161 511/171 

gcg gcg acg gcg acg gcg acg gcg acg ttg ctg ccg ttc gag gag gcg ccg gag atg acc 
ala ala thr ala thr ala thr ala thr leu leu pro phe glu glu ala pro glu met thr 
541/181 571/191 

age gcg ggt ggg etc etc gag cag gec gec gcg gtc gag gag gee tec gac acc gee gcg 
ser ala gly gly leu leu glu gin ala ala ala val glu glu ala ser asp thr ala ala 
601/201 631/211 

gcg aac cag ttg atg aac aat gtg ccc cag gcg ctg caa cag ctg gee cag ccc acg cag 
ala asn gin leu met asn asn val pro gin ala leu gin gin leu ala gin pro thr gin 
661/221 691/231 

ggc acc acg cct tct tec aag ctg ggt ggc ctg tgg aag acg gtc teg ccg cat egg teg 
gly thr thr pro ser ser lys leu gly gly leu trp lys thr val ser pro his arg ser 
721/241 751/251 

ccg ate age aac atg gtg teg atg gec aac aac cac atg teg atg acc aac teg ggt gtg 
pro ile ser asn met val ser met ala asn asn his met ser met thr asn ser gly val 
781/261 811/271 

teg atg acc aac acc ttg age teg atg ttg aag ggc ttt get ccg gcg gcg gee gee cag 
ser met thr asn thr leu ser ser met leu lys gly phe ala pro ala ala ala ala gin 
841/281 871/291 

gee gtg caa acc gcg gcg caa aac ggg gtc egg gcg atg age teg ctg ggc age teg ctg 
ala val gin thr ala ala gin asn gly val arg ala met ser ser leu gly ser ser leu 
901/301 931/311 

ggt tct teg ggt ctg ggc ggt ggg gtg gee gec aac ttg ggt egg gcg gee teg gtc ggt 
gly ser ser gly leu gly gly gly val ala ala asn leu gly arg ala ala ser val gly 
961/321 991/331 

teg ttg teg gtg ccg cag gee tgg gee gcg gee aac cag gca gtc acc ccg gcg gcg egg 
ser leu ser val pro gin ala trp ala ala ala asn gin ala val thr pro ala ala arg 
1021/341 1051/351 

gcg ctg ccg ctg acc age ctg acc age gec gcg gaa aga ggg ccc ggg cag atg ctg ggc 
ala leu pro leu thr ser leu thr ser ala ala glu arg gly pro gly gin met leu gly 
1081/361 1111/371 

ggg ctg ccg gtg ggg cag atg ggc gee agg gee ggt ggt ggg etc agt ggt gtg ctg cgt 

gly leu pro val gly gin met gly ala arg ala gly gly gly leu ser gly val leu arg 

1141/381 1171/391 

gtt ccg ccg cga ccc tat gtg atg ccg cat tct ccg gcg gee ggc tag 

val pro pro arg pro tyr val met pro his ser pro ala ala gly AMB 

SEQ ID No.5P 



FIGURE 5P 
REPLACEMENT SHEET (TttJLE 26) 




GGA TCC TGA TGC AAG TGG TCC 
gly ser OPA cys lys trp ser 
61/21 

TTG GTG CAT CCG GGC TGC GAG 
leu val his pro gly cys glu 
121/41 

CTT GCC CAC TGT TCC TCC CTG 
leu ala his cys ser ser leu 
181/61 

GCG GTG CCT GGG CAT CCA GCA 
ala val pro gly his pro ala 
241/81 

TCA CCA ACG TCG AAA CCG ACG 
ser pro thr ser lys pro thr 
301/101 

GCT GTA CCC TCA CCG AAC GTG 
ala val pro ser pro asn val 
361/121 

GCA GCC GAG CTC AAG GTG TCC 
ala ala glu leu lys val ser 
421/141 

GAA GGA GAA GAT C 
glu gly glu asp 



GAT CCT GAT GCA AGT GGT CCG 
asp pro asp ala ser gly pro 
62/21 

TGG TGC ATC CGG GCT GCG AGC 
trp cys ile arg ala ala ser 
122/41 

TTG CCC ACT GTT CCT CCC TGC 
leu pro thr val pro pro cys 
182/61 

CGG TGC CTG GGC ATC CAG CAA 
arg cys leu gly ile gin gin 
242/81 

CAC CAA CGT CGA AAC CGA CGG 
his gin arg arg asn arg arg 
302/101 

CTG TAC CCT CAC CGA ACG TGA 
leu tyr pro his arg thr OPA 
362/121 

CAG CCG AGC TCA AGG TGT CCC 
gin pro ser ser arg cys pro 
422/141 

AAG GAG AAG ATC 
lys glu lys ile 



19/185 









31/11 




GGG 


ATT 


TGT 


CGG CAG 


CCA 


gly 


ile 


cys 


arg glh 


pro 








91/31 




CAT 


GCA 


CGC 


ACC GAC 


CAG 


his 


ala 


arg 


thr asp 


gin 








151/51 




CCG 


GCA 


CCA 


TGT GCG 


ACA 


pro 


ala 


pro 


cys ala 


thr 








211/71 




AAA 


CGG 


GGA 


GCT CAA 


GAA 


lys 


arg 


gly 


ala gin 


glu 








271/91 




GTT 


GCC 


AGC 


CGG CCC 


ACG 


val 


ala 


ser 


arg pro 


thr 








331/111 




AGT 


CCC 


ACA 


CCG CGG 


AGG 


ser 


pro 


thr 


pro arg 


arg 








391/131 




CGC 


ACC 


ACT 


GTC TCG 


AAT 


arg 


thr 


thr 


val ser 


asn 



SEQ ID N0.6A 
FIGURE 6A 









32/11 




GGA 


TTT 


GTC 


GGC AGC 


CAC 


gly 


phe 


val 


gly ser 


his 








92/31 




ATG 


CAC 


GCA 


CCG ACC 


AGC 


met 


his 


ala 


pro thr 


ser 








152/51 




CGG 


CAC 


CAT 


GTG CGA 


CAA 


arg 


his 


his 


val arg 


gin 








212/71 




AAC 


GGG 


GAG 


CTC AAG 


AAC 


asn 


gly 


glu 


leu lys 


asn 








272/91 




TTG 


CCA 


GCC 


GGC CCA 


CGA 


leu 


pro 


ala 


gly pro 


arg 








332/111 




GTC 


CCA 


CAC 


CGC GGA 


GGC 


val 


pro 


his 


arg gly 


gly 








392/131 




GCA 


CCA 


CTG 


TCT CGA 


ATG 


ala 


pro 


leu 


ser arg 


met 



SEQ ID N0.6B 
FIGURE 6B 




CCG CGG TCC CGT CGA CCA ACG 
arg arg ser arg arg pro thr 

CGC GGC GAG CGC GGC TAG CTG 
arg gly glu arg gly AMB leu 

AGC TTA AGC GCA GCA GTA CCG 
ser leu ser ala ala val pro 

CGA TTC ATG AAC GAG GGG TCG 
arg phe met asn glu gly sec 

ATA TTG CGT GCT CGA GGG TCC 
ile leu arg ala arg gly ser 

CGG GCG ACT CTG GCG TCG TTA 
arg ala thr leu ala ser leu 

GCT TTT AAC CGA CCG GAT CCA 
ala phe asn arg pro asp pro 



GGC GGT CCC GTC GAC CAA CGT 
gly gly pro val asp gin arg 

GCG GCG AGC GCG GCT AGC TGC 
ala ala ser ala ala ser cys 

GCT TAA GCG CAG CAG TAC CGG 
ala OCH ala gin gin tyr arg 

GAT TCA TGA ACG AGG GGT CGT 
asp ser OPA thr arg gly arg 

TAT TGC GTG CTC GAG GGT CCG 
tyr cys val leu glu gly pro 

GGG CGA CTC TGG CGT CGT TAG 
gly arg leu trp arg arg AMB 

CTT TTA ACC GAC CGG ATC CAG 
leu leu thr asp arg ile gin 



REPLACEMENT SHEET (RULE 26) 



20/185 



33/11 

ATC CTG ATG CAA GTG GTC CGG GAT TTG TCG GCA GCC ACG GCG GTC CCG TCG ACC AAC GTT 
ile leu met gin val val arg asp leu ser ala ala thr ala val pro ser thr asn val 
63/21 93/31 

GGT GCA TCC GGG -CTG . CGA . GCA TGC ACG CAC CGA CCA GCG CGG CGA GCG CGG CTA GCT GCT 
gly ala ser gly leu arg ala cys thr his arg pro ala arg arg ala arg leu ala ala 
123/41 153/51 

TGC CCA CTG TTC CTC CCT GCC GGC ACC ATG TGC GAC AAG CTT AAG CGC AGC AGT ACC GGC 
cys pro leu phe leu pro ala gly thr met cys asp lys leu lys arg scr ser thr gly 
183/61 213/71 

GGT GCC TGG GCA TCC AGC AAA ACG GGG AGC TCA AGA ACG ATT CAT GAA CGA GGG GTC GTC 
gly ala trp ala ser ser lys thr gly ser ser arg thr ile his glu arg gly val val 
243/81 273/91 

ACC AAC GTC GAA ACC GAC GGT TGC CAG CCG GCC CAC GAT ATT GCG TGC TCG AGG GTC CGC 
thr asn val glu thr asp gly cys gin pro ala his asp ile ala cys ser arg val arg 
303/101 333/111 

TGT ACC CTC ACC GAA CGT GAG TCC CAC ACC GCG GAG GCG GGC GAC TCT GGC GTC GTT AGC 
cys thr leu thr glu arg glu ser his thr ala glu ala gly asp ser gly val val ser 
363/121 393/131 

AGC CGA GCT CAA GGT GTC CCG CAC CAC TGT CTC GAA TGC TTT TAA CCG ACC GGA TCC AGA 
ser arg ala gin gly val pro his his cys leu glu cys phe OCH pro thr gly ser arg 
423/141 

AGG AGA AGA TC 
arg arg arg 



SEQ ID N0.6C 
FIGURE 6C 



CCG TCG GCA ACT TGG 
pro ser ala thr trp 
61/21 

AGC GGC TCC ATC GGC 
ser gly ser ile gly 
121/41 

TGG CCG GTG ATA TAT 
trp pro val ile tyr 
181/61 

CGC CGC GGT GCT CGA 
arg arg gly ala arg 
241/81 

CTT GTA TCT CTT CTC 
leu val ser leu leu 
301/101 

AGG TTC ACC AGG TCT 
arg phe thr arg ser 



CCG 


CTG 


AGG 


TCG 


GCT 


pro 


leu 


arg 


ser 


ala 


TTT 


GCT 


GGT 


AGC 


GGT 


phe 


ala 


gly 


ser 


gly 


TCG 


GTC 


AGA 


CGG 


GTA 


trp 


val 


arg 


arg 


val 


GCC 


AGG 


CTT 


ACG 


ACC 


ala 


arg 


leu 


thr 


thr 


CGT 


GCC 


ACC 


CCC 


TAG 


arg 


ala 


thr 


pro 


AMB 


CAC 


CAG 


ATC 






his 


gin 


ile 







31/11 

TGA TCC CTG GGC CGA 
OPA ser leu gly arg 
91/31 

TCG GCG GGA AGC TAG 
ser ala gly ser AMB 
151/51 

TGG CGG CGG CTG AGG 
trp arg arg leu arg 
211/71 

AGG GAA TTT CGA AAA 
arg glu phe arg lys 
271/91 

GTG TAG TGT TTT CGA 
val AMB cys phe arg 



GGC GGG TCA GCC AAT 
gly gly ser ala asn 

CGG CGA CGT TGT CGG 
arg arg arg cys arg 

TGA TCT GCG ACA CGC 
OPA ser ala thr arg 

TGT TAT TCA GAA CAT 
cys tyr ser glu his 

GTA CCG GCA GAT CCC 
val pro ala asp pro 



SEQ ID No.7A 
FIGURE 7A 



REPLACEMENT SHEET (RULE 26) 



9 



21/185 





















O O / 1 1 

32/11 


















CGT CGG 


CAA 


CTT 


GGC 


CGC 


TGA 


GGT 


CGG 


CTT 


GAT CCC 


TGG 


GCC 


GAG 


GCG 


GGT 


CAG 


CCA 


ATA 


arg arg 


gin 


leu 


gly 


arg 


OPA 


gly 


arg 


leu 


asp pro 


trp 


ala 


glu 


ala 


gly 


gin 


pro 


lie 


62/21 


































GCG GCT 


CCA 


TCG 


GCT 


TTG 


CTG 


GTA 




/-« rn rp 




ran 






rzczc 




ul 1 






ala ala 


pro 


ser 


ala 


leu 


leu 


val 


ala 


,,-1 

VH1 


arg arg 


glu 


aia 


s e r 


gly 


asp 


Val 


Vet J. 


gly 


TOO / A 1 
















152/51 


















GGC CGG 


TGA 


TAT 


ATT 


GGG 


TCA 


GAC 


GGG 


TAT 


GGC GGC 


GGC 


TGA 


GGT 


GAT 


CTG 


CGA 


CAC 


GCC 


gly arg 


OPA 


tyr 


ile 


gly 


ser 


asp 


gly 


tyr 


gly gly 


gly 


OPA 


gly 


asp 


leu 


arg 


his 


ala 


182/61 
















212/71 


















GCC GCG 


GTG 


CTC 


GAG 


CCA 


GGC 


TTA 


CGA 


CCA 


GGG AAT 


TTC 


GAA 


AAT 


GTT 


ATT 


CAG 


AAC 


ATC 


ala ala 


val 


leu 


glu 


pro 


gly 


leu 


arg 


pro 


gly asn 


phe 


glu 


asn 


val 


ile 


gin 


asn 


ile 


242/81 


















272/91 


















TTG TAT 


CTC 


TTC 


TCC 


GTG 


CCA 


CCC 


CCT 


AGG 


TGT AGT 


GTT 


TTC 


GAG 


TAC 


CGG 


CAG 


ATC 


CCA 


leu tyr 


leu 


phe 


ser 


val 


pro 


pro 


pro 


arg 


cys ser 


val 


phe 


glu 


tyr 


arg 


gin 


ile 


pro 


302/101 




































GGT TCA 


CCA 


GGT 


CTC 


ACC 


AGA 


TC 
























gly ser 


pro 


gly 


leu 


thr 


arg 



























SEQ ID No.7B 
FIGURE 7B 



33/11 



GTC GGC 


AAC 


TTG 


GCC 


GCT 


GAG 


GTC 


GGC 


TTG 


ATC CCT 


GGG 


CCG 


AGG 


CGG 


GTC 


AGC 


CAA 


TAG 


val gly 


asn 


leu 


ala 


ala 


glu 


val 


gly 


leu 


ile pro 


gly 


pro 


arq 


arg 


val 


ser 


gin 


AMB 


63/21 


















93/31 


















CGG CTC 


CAT 


CGG 


CTT 


TGC 


TGG 


TAG 


CGG 


TTC 


GGC GGG 


AAG 


CTA 


GCG 


GCG 


ACG 


TTG 


TCG 


GTG 


arg leu 


his 


arg 


leu 


cys 


trp 


AMB 


arg 


phe 


gly gly 


lys 


leu 


ala 


ala 


thr 


leu 


ser 


val 


123/41 


















153/51 


















GCC GGT 


GAT 


ATA 


TTG 


GGT 


CAG 


ACG 


GGT 


ATG 


GCG GCG 


GCT 


GAG 


GTG 


ATC 


TGC 


GAC 


ACG 


CCG 


ala gly 


asp 


ile 


leu 


gly 


gin 


thr 


gly 


met 


ala ala 


ala 


glu 


val 


ile 


cys 


asp 


thr 


pro 


183/61 
















213/71 


















CCG CGG 


TGC 


TCG 


AGC 


CAG 


GCT 


TAC 


GAC 


CAG 


GGA ATT 


TCG 


AAA 


ATG 


TTA 


TTC 


AGA 


ACA 


TCT 


pro arg 


cys 


ser 


ser 


gin 


ala 


tyr 


asp 


gin 


gly ile 


ser 


lys 


met 


leu 


phe 


arg 


thr 


ser 


243/81 
















273/91 


















TGT ATC 


TCT 


TCT 


CCG 


TGC 


CAC 


CCC 


CTA 


GGT 


GTA GTG 


TTT 


TCG 


AGT 


ACC 


GGC 


AGA 


TCC 


CAG 


cys ile 


ser 


ser 


pro 


cys 


his 


pro 


leu 


gly 


val val 


phe 


ser 


ser 


thr 


gly 


arg 


ser 


gin 



303/101 

GTT CAC CAG GTC TCA CCA GAT C 
val his gin val ser pro asp 

SEQ ID No.7C 
FIGURE 7C 



REPLACEMENT SHEET (RULE 26) 




22/185 





















O T /IT 


















CTT TGC 


GTG 


ATG 


TCC 


AAT 


GGC 


GAA 


AAC 


GAC 


GCC TTG 


TCA 


TCG 


CAA 


TCG 


TCA 


GCA 


CCG 


GCC 


leu cys 


val 


met 


ser 


asn 


gly 


glu 


asn 


asp 


ala leu 


ser 


ser 


gin 


ser 


ser 


ala 


pro 


ala 


61/21 


















91/31 


















TAG TTT 


TCG 


CGA 


TGA 


CGC 


TCG 


TTC 


TGA 


CCG 


GAC TTG 


TGA 


ACG 


GGT 


TTC 


GGG 


TCG 


AGG 


CCG 


AMB phe 


ser 


arg 


OPA 


arg 


ser 


phe 


OPA 


pro 


asp leu 


OPA 


tnr 


gly 


_ i_ _ 
pne 


gly 


ser 


arg 


pro 


121/41 


















TCI /C1 

lbl/Dl 


















AGC GAA 


CCG 


TCG 


ATT 


CCA 


TGG 


GTG 


TCG 


ACG 


CAT TCG 


TGG 


TCA 




L-LCj 






LAb 




ser glu 


pro 


ser 


ile 


pro 


trp 


val 


ser 


thr 


his ser 


trp 


ser 


arg 


pro 


aia 


arg 


gin 


asp 


181/61 
















Z 11/ 11 


















CGT TCC 


TGG 


GTT 


CGA 


CAC 


CAT 


TCG 


CCC 


AAA 


1 CG ALL 


1 OL, 


LLL 


Abb 


lib 




(jib 




^_ 1 b 


arg ser 


trp 


Val 


arg 




hi <; 


ser 


p r o 




ser thr 


cys 


pro 


arg 


leu 


leu 


val 


ara 


leu 


241/81 














271/91 


















GCG TCT 


TGG 


CTG 


CCG 


ccc 


CAC 


TAG 


CGA 


CTG 


CGC CGT 


CGA 


CGA 


TCC 


GGC 


AGG 


GCA 


CGT 


CAG 


ala ser 


trp 


leu 


pro 


pro 


his 


AMB 


arg 


leu 


arg arg 


arg 


arg 


ser 


gly 


arg 


ala 


arg 


gin 


301/101 
















331/111 


















CGC GAA 


ACG 


TCA 


CCG 


CGT 


TCG 


GGG 


CAC 


CAG 


AGC ACG 


GAC 


CCG 


GCA 


TGC 


CGC 


GGG 


TCT 


CGG 


arg glu 


thr 


ser 


pro 


arg 


ser 


gly 


his 


gin 


ser thr 


asp 


pro 


ala 


cys 


arg 


gly 


ser 


arg 


361/121 


















391/131 


















ACG GTC 


GGG 


CGC 


CAT 


CGA 


CGC 


CGG 


ACG 


AGG 


TCG CGG 


TGT 


CGA 


GCA 


CGC 


TGG 


GCC 


GAA 


ACC 


thr val 


gly 


arg 


his 


arg 


arg 


arg 


thr 


arg 


ser arg 


cys 


arg 


ala 


arg 


trp 


ala 


glu 


thr 



421/141 

TCG GCG ACG ATC 
ser ala thr ile 

SEQ ID N0.8A 
FIGURE 8A 



32/11 



TTT GCG 


TGA 


TGT 


CCA 


ATG 


GCG 


AAA 


ACG 


ACG 


CCT TGT 


CAT 


CGC 


AAT 


CGT 


CAG 


CAC 


CGG 


CCT 


phe ala 


OPA 


cys 


pro 


met 


ala 


lys 


thr 


thr 


pro cys 


his 


arg 


asn 


arg 


gin 


his 


arg 


pro 


62/21 


















92/31 


















AGT TTT 


CGC 


GAT 


GAC 


GCT 


CGT 


TCT 


GAC 


CGG 


ACT TGT 


GAA 


CGG 


GTT 


TCG 


GGT 


CGA 


GGC 


CGA 


ser phe 


arg 


asp 


asp 


ala 


arg 


ser 


asp 


arg 


thr cys 


glu 


arg 


val 


ser 


gly 


arg 


gly 


arg 


122/41 


















152/51 


















GCG AAC 


CGT 


CGA 


TTC 


CAT 


GGG 


TGT 


CGA 


CGC 


ATT CGT 


GGT 


CAA 


GGC 


CGG 


CGC 


GGC 


AGG 


ACC 


ala asn 


arg 


arg 


phe 


his 


gly 


cys 


arg 


arg 


ile arg 


gly 


gin 


gly 


arg 


arg 


gly 


arg 


thr 


182/61 


















212/71 


















GTT CCT 


GGG 


TTC 


GAC 


ACC 


ATT 


CGC 


CCA 


AAT 


CGA CCT 


GCC 


CCA 


GGT 


TGC 


TCG 


TGC 


GCC 


TGG 


val pro 


gly 


phe 


asp 


thr 


ile 


arg 


pro 


asn 


arg pro 


ala 


pro 


gly 


cys 


ser 


cys 


ala 


trp 


242/81 
















272/91 


















CGT CTT 


GGC 


TGC 


CGC 


CCC 


ACT 


AGC 


GAC 


TGC 


GCC GTC 


GAC 


GAT 


CCG 


GCA 


GGG 


CAC 


GTC 


AGC 


arg leu 


gly 


cys 


arg 


pro 


thr 


ser 


asp 


cys 


ala val 


asp 


asp 


pro 


ala 


gly 


his 


val 


ser 


302/101 


















332/111 


















GCG AAA 


CGT 


CAC 


CGC 


GTT 


CGG 


GGC 


ACC 


AGA 


GCA CGG 


ACC 


CGG 


CAT 


GCC 


GCG 


GGT 


CTC 


GGA 


ala lys 


arg 


his 


arg 


val 


arg 


gly 


thr 


arg 


ala arg 


thr 


arg 


his 


ala 


ala 


gly 


leu 


gly 


362/121 
















392/131 


















CGG TCG 


GGC 


GCC 


ATC 


GAC 


GCC 


GGA 


CGA 


GGT 


CGC GGT 


GTC 


GAG 


CAC 


GCT 


GGG 


CCG 


AAA 


CCT 


arg ser 


gly 


ala 


ile 


asp 


ala 


gly 


arg 


gly 


arg gly 


val 


glu 


his 


ala 


gly 


pro 


lys 


pro 



422/141 

CGG CGA CGA TC 
arg arg arg 



SEQ ID N0.8B 



FIGURE 8B 
REPLACEMENT SHEET (RULE 26) 




23/185 





















33/11 


















TTG CGT 


GAT 


GTC 


CAA 


TGG 


CGA 


AAA 


CGA 


CGC 


CTT GTC 


ATC 


GCA 


ATC 


GTC 


AGC 


ACC 


GGC 


CTA 


leu arg 


asp 


val 


gin 


trp 


arg 


lys 


arg 


arg 


leu val 


ile 


vala 


ile 


val 


ser 


thr 


gly 


leu 


63/21 


















93/31 




\ 














GTT TTC 


GCG 


ATG 


ACG 


CTC 


GTT 


CTG 


ACC 


GGA 


CTT GTG 


AAC 


GGG 


TTT 


CGG 


GTC 


GAG 


GCC 


GAG 


val phe 


ala 


met 


thr 


leu 


val 


leu 


thr 


gly 


leu val 


asn 


gly 


phe 


arg 


val 


glu 


ala 


glu 


123/41 


















153/51 


















CGA ACC 


GTC 


GAT 


TCC 


ATG 


GGT 


GTC 


GAC 


GCA 


TTC GTG 


GTC 


AAG 


GCC 


GGC 


GCG 


GCA 


GGA 


CCG 


arg thr 


val 


asp 


ser 


met 


gly 


val 


asp 


ala 


phe val 


val 


lys 


ala 


gly 


ala 


ala 


gly 


pro 


183/61 


















213/71 


















TTC CTG 


GGT 


TCG 


ACA 


CCA 


TTC 


GCC 


f~* A A 










oil 


1 


\*>\j 1 


fieri 


^.v^ J. 




phe leu 


gly 


ser 


thr 


pro 


pne 


ala 


gin 


lie 


asp leu 


pro 


gin 


val 


d. -L Cl 


arg 


old 


pro 


gly 


243/81 
















O *7 ^ / Q 1 


















Gl O lib 


1 








<^ 1 T\. 




ACT 


GCG 


CCG TCG 


ACG 


ATC 


CGG 


CAG 


GGC 


ACG 


TCA 


GCG 


val leu 


ala 


ala 


ala 


pro 


leu 


ala 


thr 


ala 


pro ser 


thr 


ile 


arg 


gin 


gly 


thr 


ser 


ala 


303/101 


















333/111 


















CGA AAC 


GTC 


ACC 


GCG 


TTC 


GGG 


GCA 


CCA 


GAG 


CAC GGA 


CCC 


GGC 


ATG 


CCG 


CGG 


GTC 


TCG 


GAC 


arg asn 


val 


thr 


ala 


phe 


gly 


ala 


pro 


glu 


his gly 


pro 


gly 


met 


pro 


arg 


val 


ser 


asp 


363/121 


















393/131 


















GGT CGG 


GCG 


CCA 


TCG 


ACG 


CCG 


GAC 


GAG 


GTC 


GCG GTG 


TCG 


AGC 


ACG 


CTG 


GGC 


CGA 


AAC 


CTC 


gly arg 


ala 


pro 


ser 


thr 


pro 


asp 


glu 


val 


ala val 


ser 


ser 


thr 


leu 


gly 


arg 


asn 


leu 


423/141 




































GGC GAC 


GAT 


C 
































gly asp 


asp 



































SEQ ID N0.8C 
FIGURE 8C 



part of the nucleotide sequence of seq8A 



1/1 31/11 



CAG GTT 


GCT 


CGT 


GCG 


CCT 


GGC 


GTC 


TTG 


GCT 


GCC GCC 


CCA 


CTA 


GCG 


ACT 


GCG 


CCG 


TCG 


ACG 


gin val 


ala 


arg 


ala 


pro 


gly 


val 


leu 


ala 


ala ala 


pro 


leu 


ala 


thr 


ala 


pro 


ser 


thr 


61/21 


















91/31 


















ATC CGG 


CAG 


GGC 


ACG 


TCA 


GCG 


CGA 


AAC 


GTC 


ACC GCG 


TTC 


GGG 


GCA 


CCA 


GAG 


CAC 


GGA 


CCC 


ile arg 


gin 


gly 


thr 


ser 


ala 


arg 


asn 


val 


thr ala 


phe 


gly 


ala 


pro 


glu 


his 


gly 


pro 


121/41 
















151/51 


















GGC ATG 


CCG 


CGG 


GTC 


TCG 


GAC 


GGT 


CGG 


GCG 


CCA TCG 


ACG 


CCG 


GAC 


GAG 


GTC 


GCG 


GTG 


TCG 


gly met 


pro 


arg 


val 


ser 


asp 


gly 


arg 


ala 


pro ser 


thr 


pro 


asp 


glu 


val 


ala 


val 


ser 


181/61 




































AGC ACG 


CTG 


GGC 


CGA 


AAC 


CTC 


GGC 


GAC 


GAT 


C 


















ser thr 


leu 


gly 


arg 


asn 


leu 


gly 


asp 


asp 





















SEQ ID No.8A f 
FIGURE 8A f 



REPLACEMENT SHEET (RULE 26) 



25/185 

sequence Rv2563 predicted by Cole et al . (Nature 393:537-544) and containing 
seq8A f 



atg 
met 
121/41 

ctt ttt gcg get ttg cgt gat 
leu phe ala ala leu arg asp 
181/61 

age acc ggc eta gtt ttc gcg 
ser thr gly leu val phe ala 
241/81 

gtc gag gec gag cga acc gtc 
val glu ala glu arg thr val 
301/101 

gcg gca gga ccg ttc ctg ggt 
ala ala gly pro phe leu gly 
361/121 

cgt gcg cct ggc gtc ttg get 
arg ala pro gly val leu ala 
421/141 

ggc acg tea gcg cga aac gtc 
gly thr ser ala arg asn val 
481/161 

egg gtc teg gac ggt egg gcg 
arg val ser asp gly arg ala 
541/181 

ggc cga aac etc ggc gac gat 
gly arg asn leu gly asp asp 
601/201 

gtg ccc gag tea acc gcg ctg 
val pro glu ser thr ala leu 
661/221 

cag cag ttg gca tac aac gga 
gin gin leu ala tyr asn gly 
721/241 

cga cag etc ccg gac ggc tat 
arg gin leu pro asp gly tyr 
781/261 

cgc ccg ttg aag gtc gcg gtg 
arg pro leu lys val ala val 
841/281 

gcg gcg ttg ate gtc ggc teg 
ala ala leu ile val gly ser 
901/301 

gcg gtg ttc aag gcg ate ggc 
ala val phe lys ala ile gly 
961/321 

gcg gtc gtc gtc gcg ctg etc 
ala val val val ala leu leu 
1021/341 

ccg ttg ttc ccg atg act gtc 
pro leu phe pro met thr val 
1081/361 

gcg act gtg ate ggt ctg ctg 
ala thr val ile gly leu leu 
1141/381 

ccg gca eta gcg ttc gga ggt 
pro ala leu ala phe gly gly 



151/51 

gtc caa tgg cga aaa cga 
val gin trp arg lys arg 

211/71 
atg acg etc gtt ctg acc 
met thr leu val leu thr 

271/91 
gat tec atg ggt gtc gac 
asp ser met gly val asp 

331/111 
teg aca cca ttc gec caa 
ser thr pro phe ala gin 

391/131 
gee gee cca eta gcg act 
ala ala pro leu ala thr 

451/151 
acc gcg ttc ggg gca cca 
thr ala phe gly ala pro 

511/171 . 
cca teg acg ccg gac gag 
pro ser thr pro asp glu 

571/191 
ctg caa gtg ggt gcg cgc 
leu gin val gly ala arg 

631/211 
gca aag att ccc aac ate 
ala lys ile pro asn ile 

691/231 
cag ccg aca ate agt teg 
gin pro thr ile ser ser 

751/251 
cag acc gtc aat cga gcg 
gin thr val asn arg ala 

811/271 
gat gcg ate acg gtt gtg 
asp ala ile thr val val 

871/291 
gtg gtc tac etc tct gcg 
val val tyr leu ser ala 

931/311 
gtg ccg acg cgc teg att 
val pro thr arg ser ile 

991/331 
gcg gcg gtg gtt ggc ggc 
ala ala val val gly gly 

1051/351 
gtg gta ccc ctg agt gec 
val val pro leu ser ala 

1111/371 
gee age gtc gca gga ctg 
ala ser val ala gly leu 

ccc tag 
pro AMB 

SEQ ID N0.8D 
FIGURE 8D 



cgc ctt gtc ate gca ate gtc 
arg leu val ile ala ile val 

gga ctt gtg aac ggg ttt egg 
gly leu val asn gly phe arg 

gca ttc gtg gtc aag gec ggc 
ala phe val val lys ala gly 

ate gac ctg ccc cag gtt get 
ile asp leu pro gin val ala 

gcg ccg teg acg ate egg cag 
ala pro ser thr ile arg gin 

gag cac gga ccc ggc atg ccg 
glu his gly pro gly met pro 

gtc gcg gtg teg age acg ctg 
val ala val ser ser thr leu 

act ttg egg ate gtc ggc ate 
thr leu arg ile val gly ile 

ttc ctg acc acc gaa ggc eta 
phe leu thr thr glu gly leu 

ate ggg ate gac ggg atg ccc 
ile gly ile asp gly met pro 

gat get gtc age gat ctg atg 
asp ala val ser asp leu met 

gcg gtc ttg ctg tgg ate gtt 
ala val leu leu trp ile val 

ttg gag egg ctg cgt gac ttt 
leu glu arg leu arg asp phe 

ctg gee ggg ctg gcg ctg cag 
leu ala gly leu ala leu gin 

ate ctt teg ctg ctg ttg gcg 
ile leu ser leu leu leu ala 

ttc gtg gcg eta ccg gcg ate 
phe val ala leu pro ala ile 

egg cgc gtg gtg gcg ate gat 
arg arg val val ala ile asp 



REPLACEMENT SHEET (RULE 26) 



26/185 

ORF predicted by Cole et al . (Nature 393:537-544) and containing Rv2563 
1/1 31/11 

tag gtt tea aga agg cct gtg cag gtt tec gca gec tgg gec gcg gcg cca ccg aag age 
AMB val ser arg arg pro val gin val ser ala ala trp ala ala ala pro pro lys ser 
61/21 91/31 

ccg ccg aaa tgg get aat egg gtt cgc ttg get cga teg ccg atg ate teg acc gee acg 
pro pro lys trp ala asn arg val arg leu ala arg ser pro met ile ser thr ala thr 
121/41 151/51 

acc gac ccc etc acc teg gtc gaa cct egg cga acc aac gcg gca acg cca gec cat gat 
thr asp pro leu thr ser val glu pro arg arg thr asn ala ala thr pro ala his asp 
181/61 211/71 

cat ttg att ggg tec acg gaa gca ggt age ttc cgt cgc atg ctt ttt gcg get ttg cgt 
his leu ile gly ser thr glu ala gly ser phe arg arg met leu phe ala ala leu arg 
241/81 271/91 

gat gtc caa tgg cga aaa cga cgc ctt gtc ate gca ate gtc age acc ggc eta gtt ttc 
asp val gin trp arg lys arg arg leu val ile ala ile val ser thr gly leu val phe 
301/101 331/111 

gcg atg acg etc gtt ctg acc gga ctt gtg aac ggg ttt egg gtc gag gee gag cga acc 
ala met thr leu val leu thr gly leu val asn gly phe arg val glu ala glu arg thr 
361/121 391/131 

gtc gat tec atg ggt gtc gac gca ttc gtg gtc aag gee ggc gcg gca gga ccg ttc ctg 
val asp ser met gly val asp ala phe val val lys ala gly ala ala gly pro phe leu 
421/141 451/151 

ggt teg aca cca ttc gee caa ate gac ctg ccc cag gtt get cgt gcg cct ggc gtc ttg 
gly ser thr pro phe ala gin ile asp leu pro gin val ala arg ala pro gly val leu 
481/161 511/171 

get gec gec cca eta gcg act gcg ccg teg acg ate egg cag ggc acg tea gcg cga aac 
ala ala ala pro leu ala thr ala pro ser thr ile arg gin gly thr ser ala arg asn 
541/181 571/191 

gtc acc gcg ttc ggg gca cca gag cac gga ccc ggc atg ccg egg gtc teg gac ggt egg 
val thr ala phe gly ala pro glu his gly pro gly met pro arg val ser asp gly arg 
601/201 631/211 

gcg cca teg acg ccg gac gag gtc gcg gtg teg age acg ctg ggc cga aac etc ggc gac 
ala pro ser thr pro asp glu val ala val ser ser thr leu gly arg asn leu gly asp 
661/221 691/231 

gat ctg caa gtg ggt gcg cgc act ttg egg ate gtc ggc ate gtg ccc gag tea acc gcg 
asp leu gin val gly ala arg thr leu arg ile val gly ile val pro glu ser thr ala 
721/241 751/251 

ctg gca aag att ccc aac ate ttc ctg acc acc gaa ggc eta cag cag ttg gca tac aac 
leu ala lys ile pro asn ile phe leu thr thr glu gly leu gin gin leu ala tyr asn 
781/261 811/271 

gga cag ccg aca ate agt teg ate ggg ate gac ggg atg ccc cga cag etc ccg gac ggc 
gly gin pro thr ile ser ser ile gly ile asp gly met pro arg gin leu pro asp gly 
841/281 871/291 

tat cag acc gtc aat cga gcg gat get gtc age gat ctg atg cgc ccg ttg aag gtc gcg 
tyr gin thr val asn arg ala asp ala val ser asp leu met arg pro leu lys val ala 
901/301 931/311 

gtg gat gcg ate acg gtt gtg gcg gtc ttg ctg tgg ate gtt gcg gcg ttg ate gtc ggc 
val asp ala ile thr val val ala val leu leu trp ile val ala ala leu ile val gly 
961/321 991/331 

teg gtg gtc tac etc tct gcg ttg gag egg ctg cgt gac ttt gcg gtg ttc aag gcg ate 
ser val val tyr leu ser ala leu glu arg leu arg asp phe ala val phe lys ala ile 
1021/341 1051/351 

ggc gtg ccg acg cgc teg att ctg gee ggg ctg gcg ctg cag gcg gtc gtc gtc gcg ctg 
gly val pro thr arg ser ile leu ala gly leu ala leu gin ala val val val ala leu 
1081/361 1111/371 

etc gcg gcg gtg gtt ggc ggc ate ctt teg ctg ctg ttg gcg ccg ttg ttc ccg atg act 
leu ala ala val val gly gly ile leu ser leu leu leu ala pro leu phe pro met thr 
1141/381 1171/391 

gtc gtg gta ccc ctg agt gec ttc gtg gcg eta ccg gcg ate gcg act gtg ate ggt ctg 
val val val pro leu ser ala phe val ala leu pro ala ile ala thr val ile gly leu 
1201/401 1231/411 

ctg gee age gtc gca gga ctg egg cgc gtg gtg gcg ate gat ccg gca eta gcg ttc gga 
leu ala ser val ala gly leu arg arg val val ala ile asp pro ala leu ala phe gly 
1261/421 
ggt ccc tag 
gly pro AMB 

SEQ ID N0.8F 
FIGURE 8F 



REPLACEMENT SHEET (RULE 26) 



27/185 



sequence of Rv0072 predicted by Cole et al . (Nature 393:537-544) and exhibiting 
more than 11% similarity with Seq8D' 



1/1 


















31/11 


















atg etc 


ttc 


gcg 


gec 


ctg 


cgt 


gac 


atg 


caa 


tgg aga 


aag 


cgc 


cgc 


ctg 


gtc 


ate 


acg 


ate 


Met leu 


phe 


ala 


ala 


leu 


arg 


asp 


met 


gin 


trp arg 


lys 


arg 


arg 


leu 


val 


ile 


thr 


ile 


61/21 


















91/31 


















ate age 


acc 


ggg 


ctg 


ate 


ttc 


ggg 


atg 


acg 


ctt gtt 


ttg 


acc 


gga 


etc 


gcg 


aac 


ggc 


ttc 


ile ser 


thr 


gly 


leu 


ile 


phe 


gly 


met 


thr 


leu val 


leu 


thr 


gly 


leu 


ala 


asn 


gly 


pne 


121/41 


















151/51 


















egg gtg 


gag 


gec 


egg 


cac 


acc 


gtc 


gat 


tec 


atg ggt 


gtc 


gat 


gta 


ttc 


gtc 


gtc 


aga 


tec 


arg val 


glu 


ala 


arg 


his 


thr 


val 


asp 


ser 


met gly 


val 


asp 


val 


pne 


val 


val 


arg 


ser 


181/61 
















211/71 


















ggc get 


get 


gga 


cct 


ttt 


ctg 


ggt 


tea 


ata 


ccg ttt 


ccc 


gat 


gtt 


gac 


ctg 


gec 


cga 


gtg 


gly ala 


ala 


gly 


pro 


phe 


leu 


gly 


ser 


ile 


pro phe 


pro 


asp 


val 


asp 


leu 


ai a 


arg 


vai 


241/81 


















271/91 


















gec get 


gaa 


ccc 


ggt 


gtc 


atg 


gee 


gcg 


gee 


ccg ttg 


ggc 


age 


gtg 


ggg 


acg 


ate 


atg 


aaa 


ala ala 


glu 


pro 


gly 


val 


met 


ala 


ala 


ala 


pro leu 


gly 


ser 


val 


gly 


unr 


lie 


me t 


lys 


301/101 














331/111 


















gaa ggc 


acg 


teg 


acg 


cga 


aac 


gtc 


acg 


gtc 


ttc ggc 


gcg 


ccc 


gag 


cac 


gga 


cct 


ggc 


atg 


glu gly 


thr 


ser 


thr 


arg 


asn 


val 


thr 


val 


phe gly 


ala 


pro 


glu 


nis 


gly 


pro 


gly 


met 


361/121 


















391/131 


















cca egg 


gtc 


tea 


gag 


ggt 


egg 


tea 


ccg 


teg 


aaa ccg 


gac 


gaa 


gtc 


gcg 


gca 


teg 


age 


acg 


pro arg 


val 


ser 


glu 


gly 


arg 


ser 


pro 


ser 


lys pro 


asp 


glu 


val 


ala 


ala 


ser 


ser 


4- U r- 
Lill 


421/141 


















4 51/151 


















atg ggc 


cga 


cac 


etc 


ggt 


gac 


act 


gtc 


gag 


gtc ggc 


gcg 


cgc 


aga 


ttg 


egg 


gtc 


gtt 


ggc 


met gly 


arg 


his 


leu 


gly 


asp 


thr 


val 


glu 


val gly 


ala 


arg 


a rg 


leu 


arg 


va 1 


val 


gly 


481/161 


















511/171 


















att gtg 


ccg 


aat 


tec 


acc 


gcg 


ctg 


gee 


aag 


ate ccc 


aat 


gtc 


ttc 


etc 


acg 


acc 


gag 


ggc 


ile val 


pro 


asn 


ser 


thr 


ala 


leu 


ala 


lys 


ile pro 


asn 


val 


pne 


leu 


tnr 


enr 


glu 


gly 


541/181 
















571 / 1 9 1 


















tta cag 


aaa 


ttg 


gcg 


tac 


aac 


ggg 


cag 


ccg 


aat ate 


acg 


tec 


ate 


ggg 


ate 


ata 


ggt 


atg 


leu gin 


lys 


leu 


ala 


tyr 


asn 


gly 


gin 


pro 


asn ile 


tnr 


ser 


He 


gly 


ne 


lie 


gly 


me t 


601/201 
















631/ 211 


















ccc cga 


cag 


ctg 


ccg 


gag 


ggt 


tac 


cag 


act 


ttc gat 


egg 


gtg 


ggc 


g ct 


gtc 


aat 


gat 


ttg 


pro arg 


gin 


leu 


pro 


glu 


gly 


tyr 


gin 


thr 


phe asp 


arg 


val 


gly 


ala 


va i 


asn 


asp 


leu 


661/221 
















691/231 


















gtg cgc 


cca 


ttg 


aag 


gtc 


gca 


gtg 


aat 


teg 


ate teg 


ate 


gtg 


get 


gtt 


ttg 


ctg 


tgg 


att 


val arg 


pro 


leu 


lys 


val 


ala 


val 


asn 


ser 


ile ser 


He 


val 


ala 


vai 


leu 


leu 


trp 


lie 


721/241 
















751/251 


















gtg gcg 


gtg 


ctg 


ate 


gtc 


ggc 


teg 


gtg 


gtg 


tac ctt 


teg 


get 


ctt 


gag 


egg 


eta 


cgt 


gac 


val ala 


val 


leu 


ile 


val 


gly 


ser 


val 


val 


tyr leu 


ser 


ala 


leu 


glu 


a r g 


leu 


arg 


asp 


781/261 


















811/271 


















ttc gcg 


gtg 


ttc 


aag 


gcg 


att 


ggc 


acg 


cca 


acg cgc 


teg 


acc 


acg 


gec 


ggg 


etc 


g ca 


f- a 


phe ala 


val 


phe 


lys 


ala 


ile 


gly 


tnr 


pro 


thr arg 


ser 


ne 


me t 


did 


gly 




ala 


leu 


841/281 


















o / i / <z y ± 


















cag gcg 


ctg 


gtc 


"3 +- f- 


gcg 


t~ t* rr 

ttg 


ctt 




y *-y 


y u y y u y 




gtc 


gtc 


ctg 


oca 

z) Z) 


caa 


qtq 


ttg 


gin ala 


leu 


val 


ile 


ala 


leu 


leu 


ala 


ala 


val val 


gly 


val 


val 


leu 


ala 


gin 


val 


leu 


901/301 


















931/311 


















gca cca 


ctg 


ttt 


ccg 


atg 


att 


gtc 


gcg 


gta 


ccc gtc 


ggt 


get 


tac 


ctg 


gcg 


eta 


ccg 


gtg 


ala pro 


leu 


phe 


pro 


met 


ile 


val 


ala 


val 


pro val 


gly 


ala 


tyr 


leu 


ala 


leu 


pro 


val 


961/321 


















991/331 


















gec gcg 


ate 


gtc 


ate 


ggt 


ctg 


ttc 


get 


agt 


gtt gee 


gga 


ttg 


aag 


cgc 


gtg 


gtg 


acg 


gtc 


ala ala 


ile 


val 


ile 


gly 


leu 


phe 


ala 


ser 


val ala 


gly 


leu 


lys 


arg 


val 


val 


thr 


val 


1021/341 


































gat ccc 


gcg 


cag 


gcg 


ttc 


gga 


ggt 


ccc 


tag 




















asp pro 


ala 


gin 


ala 


phe 


gly 


gly 


pro 


AMB 





















SEQ ID N0.8G 
FIGURE 8G 



REPLACEMENT SHEET (RULE 26) 



28/185 



SeqBH : ORF predicted by Cole et al . (Nature 393:537-544) and containing seq8G 



1/1 

1/ 1 


















O 1 / 1 1 

J 1 / 1 1 


















tag cct 


ctg 


gga 


a tg 


etc 


ttc 


gcg 


gee 


ctg 


cgt gac 


atg 


caa 


tgg 


aga 


aag 


cgc 


cgc 


ctg 


AMB pro 


leu 


gly 


met 


leu 


pne 


ala 


ala 


leu 


arg asp 


met 


gin 


trp 


arg 


lys 


arg 


arg 


leu 




















Q 1 / O 1 

y 1/ 3i 
















gtc ate 


acg 


ate 


ate 


age 


acc 


ggg 


ctg 


ate 


ttc ggg 


atg 


acg 


ctt 


gtt 


ttg 


acc 


gga 


etc 


val ile 


thr 


ile 


i xe 


ser 


4_ V> v- 

tnr 


gly 


leu 


ile 


phe gly 


met 


thr 


leu 


val 


leu 


thr 


gly 


leu 


1 O 1 //II 

Iz X / 4 1 


















1 C 1 /CI 

lol/ ol 
















gcg aac 


ggc 


ttc 


egg 


gtg 


gag 


gee 


egg 


cac 


acc gtc 


gat 


tec 


atg 


ggt 


gtc 


gat 


gta 


ttc 


ala asn 


gly phe 


arg 


val 


glu 


ala 


arg. 


ni s 


thr val 


asp 


ser 


met 


gly 


val 


asp 


val 


phe 


1 Q 1 /CI 

X O X / D X 


















Oil /T1 

Z 1 1 / / I 














gtc gtc 


aga 


tec 


ggc 


get 


get 


gga 


cct 


ttt 


ctg ggt 


tea 


ata 


ccg 


ttt 


ccc 


gat 


gtt 


gac 


val val 


arg 


ser 


gly 


ax a 


ala 

axa 


gly 


pro 


phe 


leu gly 


ser 


lie 


pro 


pne 


pro 


asp 


val 


asp 


9/11 /pi 
Z *i J. / OX 


















9 *7 1 / Q 1 

z / x / yi 
















ctg gec 


cga 


gtg 


gee 


get 


gaa 


ccc 


ggt 


gtc 


atg gec 


gcg 


gee 


ccg 


ttg 


ggc 


age 


gtg 


ggg 


leu ala 


arg 


Val 


a l a 
a x a 


a l a 
ax a 


glu 


pro 


gly 


t ra 1 

va x 


me t ala 


axa 


ala 


pro 


leu 


gly 


ser 


val 


gly 


Tni /inn 

Jul/ 1U1 


















OOI / I 1 1 

jjI/ XXX 
















acg ate 


a tg 


aaa 


gaa 


ggc 


acg 


teg 


acg 


cga 


aac gtc 


acg 


gtc 


ttc 


ggc 


gcg 


ccc 


gag 


cac 


unr lie 


met 


lys 


glu 


gly 


fr- h> r- 

tnr 


ser 


t nr 


a rg 


asn val 


tnr 


val 


pne 


gly 


ala 


pro 


glu 


his 


0{:i / 1 9 1 

obi/ X z X 


















OQ1 / 1 O 1 

o y x / ioi 


















gga cct 


ggc 


a tg 


cca 


egg 


gtc 


tea 


gag 


ggt 


egg tea 


ccg 


teg 


aaa 


ccg 


gac 


gaa 


gtc 


gcg 


gly pro 


gly 


me t 


p r o 


a rg 


U3 i 
vax 


ser 


glu 


gly 


arg ser 


pro 


ser 


lys 


pro 


asp 


glu 


val 


ala 


4 9 1 / 1 A 1 
4 Z X / 1 1 X 


















A C 1 / 1 C T 

4 31/ X 0 X 


















gca teg 


age 


acg 


acg 


ggc 


cga 


cac 


etc 


ggt 


gac act 


gtc 


gag 


gtc 


ggc 


gcg 


cgc 


aga 


ttg 


ala ser 


ser 


un r 


me t 


gly 


a r g 


hi s 


leu 


gly 


asp thr 


val 


gxu 


val 


gly 


ala 


arg 


arg 


leu 


/I Q 1 / 1 i 
H. O X / X D X 


















ci i / i *7 i 
Oil/ X /X 


















egg gtc 


gtt 


ggc 


att 


gtg 


ccg 


aat 


tec 


acc 


gcg ctg 


gee 


aag 


ate 


ccc 


aat 


gtc 


ttc 


etc 


arg val 


vax 


gly 


x x e 


va x 


pro 


asn 


ser 


tnr 


ala leu 


ala 

axa 


lys 


: i « 

ne 


pro 


asn 


val 


pne 


leu 


C/11 /i pi 
J^l/ X O X 


















CT1 /I Q1 

o / x / i y i 


















acg acc 


gag 


ggc 


tta 


cag 


aaa 


ttg 


gcg 


tac 


aac ggg 


cag 


ccg 


aat 


ate 


acg 


tec 


ate 


ggg 


tnr tnr 


glu 


gly 


leu 


gin 


1 ys 


leu 


axa 


tyr 


asn gly 


gXn 


pro 


asn 


ile 


thr 


ser 


ile 


gly 


cm / 9 m 

DUX/ Z U X 


















coi / o 1 1 
DO 1/ Z 1 1 


















ate ata 


ggt 


at g 


ccc 


cga 


cag 


ctg 


ccg 


gag 


ggt tac 


cag 


act 


ttc 


gat 


egg 


gtg 


ggc 


get 


ile ile 


gly 


met 


pro 


arg 


gin 


leu 


pro 


glu 


gly tyr 


gin 


tnr 


phe 


asp 


a rg 


val 


gly 


ala 


DDI/ ZZ 1 


















CQ1 /OOI 

by i / z o i 


















gtc aat 


gat 


l. l. y 


gtg 


cgc 


cca 


ttg 


aag 


gtc 


gca gtg 


aat 


teg 


ate 


teg 


ate 


gtg 


get 


gtt 


val asn 


asp 


leu 


Tra 1 

vax 


a rg 


pro 


leu 


lys 


vax 


ala val 


asn 


ser 


He 


ser 


-: i — 

lie 


val 


ala 


val 


TOI / O /I 1 

/ Z X / Z 4 X- 


















n m /9m 

/OX/ Z D X 


















ttg ctg 


tgg 


att 


gtg 


gcg 


gtg 


ctg 


ate 


gtc 


ggc teg 


gtg 


gtg 


tac 


ctt 


teg 


get 


ctt 


gag 


leu leu 


trp 


ile 


vax 


a x a 


ira 1 

vax 


leu 


x X e 


val 


gly ser 


val 


vax 


tyr 


leu 


ser 


ala 


leu 


glu 


"7 ft 1 / 9 1 
/ O X / Z O X 


















pi i / 9 "7 1 

o x x / z /x 


i 
















egg eta 


cgt 


gac 


tec 


gcg 


gtg 


etc 


aag 


gcg 


att ggc 


acg 


cca 


acg 


cgc 


teg 


att 


atg 


gec 


arg leu 


arg 


asp 


phe 


ala 

ax a 


i ra 1 

va x 


phe 


lys 


ala 


ile gly 


thr 


pro 


tnr 


arg 


ser 


lie 


met 


ala 


841/281 


















871/291 


















ggg etc 


gca 


tta 


cag 


gcg 


ctg 


gtc 


aLL 


gcg 


ttg ctt 


gcg 


gcg 


gtg 


gtg 


ggc 


gtc 


gtc 


ctg 


gly leu 


ala 


leu 


gin 


ala 


leu 


val 


ile 


ala 


leu leu 


ala 


ala 


val 


val 


gly 


val 


val 


leu 


901/301 


















931/311 


















gcg cag 


gtg 


ttg 


gca 


cca 


ctg 


ttt 


ccg 


atg 


att gtc 


gcg 


gta 


ccc 


gtc 


ggt 


get 


tac 


ctg 


ala gin 


val 


leu 


ala 


pro 


leu 


phe 


pro 


met 


ile val 


ala 


val 


pro 


val 


gly 


ala 


tyr 


leu 


961/321 


















991/331 


















gcg eta 


ccg 


gtg 


gee 


gcg 


ate 


gtc 


ate 


ggt 


ctg ttc 


get 


agt 


gtt 


gec 


gga 


ttg 


aag 


cgc 


ala leu 


pro 


val 


ala 


ala 


ile 


val 


ile 


gly 


leu phe 


ala 


ser 


val 


ala 


gly 


leu 


lys 


arg 


1021/341 
















1051/351 
















gtg gtg 


acg 


gtc 


gat 


ccc 


gcg 


cag 


gcg 


ttc 


gga ggt 


ccc 


tag 














val val 


thr 


val 


asp 


pro 


ala 


gin 


ala 


phe 


gly gly 


pro 


AMB 















SEQ ID N0.8H 

FIGURE 8H 
REPLACEMENT SHEET (RULE 26) 



29/185 



31/11 



CGA GGC 


CGA 


GCG 


AAC 


CGT 


CGA 


TTC 


CAT 


c 


ibl L bA 


LbL 


A TT 
A. 1 1 


LL 1 


LL 1 


L.MjH. 


CCC 


CCC 
LLL 


LLL 


arg gly 


arg 


ala 


asn 


arg 


arg 


phe 


nis 


gly 


cys arg 


arg 


lie 


a rg 


qly 


gin 


gly 


arg 


a rg 


61/21 


















9 1/ i 1 


















GGC AGL 


ALL 


/— rp rp 


PPT 
LL i 


LLL 


TTP 


c ap 

L/\L 


Apr 
M.LL 


ATT 


prp ppn 


A AT 
nnl 


ppa 


PPT 


CCC 


ppn 




TGC 


TCG 


gly arg 


thr 


val 


pro 


gly 


K n 

pne 


asp 


+- K v- 

tnr 


ne 


arg pro 


asn 


a r g 


pro 


a i a 


pro 


gly 


cys 


ser 


12 1/4 1 




































TGC GCC 


TGG 


CGT 


CTT 


GGC 


TGC 


CGC 


CCC 


ACT 


AGC GAC 


TGC 


GCC 


GTC 


GAC 


GAT 


CCG 


GCA 


GGG 


cys ala 


trp 


arg 


leu 


gly 


cys 


arg 


pro 


thr 


ser asp 


cys 


ala 


val 


asp 


asp 


pro 


ala 


gly 


181/61 


















211/71 


















CAC GTC 


AGC 


GCG 


AAA 


CGT 


CAC 


CGC 


GTT 


CGG 


GGC ACC 


AGA 


GCA 


CGG 


ACC 


CGG 


CAT 


GCC 


GCG 


his val 


ser 


ala 


lys 


arg 


his 


arg 


val 


arg 


gly thr 


arg 


ala 


arg 


thr 


arg 


his 


ala 


ala 


241/81 


















271/91 


















GGT CTC 


GGA 


CGG 


TCG 


GGC 


GCC 


ATC 


GAC 


GCC 


GGA CGA 


GGT 


CGC 


GGT 


GTC 


GAG 


CAC 


GCT 


GGG 


gly leu 


gly 


arg 


ser 


gly 


ala 


ile 


asp 


ala 


gly arg 


gly 


arg 


gly 


val 


glu 


his 


ala 


gly 


301/101 




































CCG AAA 


CCT 


CGG 


CGA 


CGA 


TC 


























pro lys 


pro 


arg 


arg 


arg 










































SEQ ID No.9A 
































FIGURE 9A 




































32/11 


















GAG GCC 


GAG 


CGA 


ACC 


Li L 


LAI 


J. LL 


ATG 


GGT 


GTC GAC 


GCA 


TTP 


Kd 1 \J 






CCC 


GGC 


GCG 


glu ala 


glu 


arg 


tnr 


val 


asp 


ser 


met 


gly 


val asp 


a X a 


phe 


V dJL 


Vet J- 


lys 


ala 


gly 


ala 


62 12 1 


















92/31 


















bCA LLA 


LLL 


rnrnf* 
1 1L 




PPT 


X \^\J 


APA 


CCA 


TTC 


GCC CAA 


ATC 


GAC 


CTG 


ccc 


GAG 


GTT 


GCT 


CGT 


ala gly 


pro 


phe 


leu 


gly 


ser 


t-hr 

L.I1 J. 


pro 


phe 


ala gin 


ile 


asp 


leu 


pro 


gin 


va 1 


ala 


a r g 




















152/51 


















CCC CCT 


GGC 


GTC 


TTG 


GCT 


GCC 


GCC 


CCA 


CTA 


GCG ACT 


GCG 


CCG 


TCG 


ACG 


ATC 


CGG 


CAG 


GGC 


ala pro 


gly 


val 


leu 


ala 


ala 


ala 


pro 


leu 


ala thr 


ala 


pro 


ser 


thr 


ile 


arg 


gin 


gly 


182/61 
















212/71 


















ACG TCA 


GCG 


CGA 


AAC 


GTC 


ACC 


GCG 


TTC 


GGG 


GCA CCA 


GAG 


CAC 


GGA 


CCC 


GGC 


ATG 


CCG 


CGG 


thr ser 


ala 


arg 


asn 


val 


thr 


ala 


phe 


gly 


ala pro 


glu 


his 


gly 


pro 


gly 


met 


pro 


arg 


242/81 


















272/91 
















GTC TCG 


GAC 


GGT 


CGG 


GCG 


CCA 


TCG 


ACG 


CCG 


GAC GAG 


GTC 


GCG 


GTG 


TCG 


AGC 


ACG 


CTG 


GGC 


val ser 


asp 


gly 


arg 


ala 


pro 


ser 


thr 


pro 


asp glu 


val 


ala 


val 


ser 


ser 


thr 


leu 


gly 


302/101 




































CGA AAC 


CTC 


GGC 


GAC 


GAT 


C 


























arg asn 


leu 


gly 


asp 


asp 





























SEQ ID No.9B 
FIGURE 9B 



REPLACEMENT SHEET (RULE 26) 




30/185 



33/11 



AGG CCG 


AGC 


GAA 


CCG 


TCG 


ATT 


CCA 


TGG 


GTG 


TCG ACG 


CAT 


TCG 


TGG 


TCA 


AGG 


CCG 


GCG 


CGG 


arg pro 


ser 


glu 


pro 


ser 


ile 


pro 


trp 


val 


ser thr 


his 


ser 


trp 


ser 


arg 


pro 


ala 


arg 


63/21 


















93/31 


















CAG GAC 


CGT 


TCC 


TGG 


GTT 


CGA 


CAC 


CAT 


TCG 


CCC AAA 


TCG 


ACC 


TGC 


CCC 


AGG 


TTG 


CTC 


GTG 


gin asp 


arg 


ser 


trp 


val 


arg 


his 


his 


ser 


pro lys 


ser 


thr 


cys 


pro 


arg 


leu 


leu 


val 


123/41 


















153/51 


















CGC CTG 


GCG 


TCT 


TGG 


CTG 


CCG 


CCC 


CAC 


TAG 


CGA CTG 


CGC 


CGT 


CGA 


CGA 


TCC 


GGC 


AGG 


GCA 


arg leu 


ala 


ser 


trp 


leu 


pro 


pro 


his 


AMB 


arg leu 


arg 


arg 


arg 


arg 


ser 


gly 


arg 


ala 


183/61 


















213/71 


















CGT CAG 


CGC 


GAA 


ACG 


TCA 


CCG 


CGT 


TCG 


GGG 


CAC CAG 


AGC 


ACG 


GAC 


CCG 


GCA 


TGC 


CGC 


GGG 


arg gin 


arg 


glu 


thr 


ser 


pro 


arg 


ser 


gly 


his gin 


ser 


thr 


asp 


pro 


ala 


cys 


arg 


gly 


243/81 


















273/91 


















TCT CGG 


ACG 


GTC 


GGG 


CGC 


CAT 


CGA 


CGC 


CGG 


ACG AGG 


TCG 


CGG 


TGT 


CGA 


GCA 


CGC 


TGG 


GCC 


ser arg 


thr 


val 


gly 


arg 


his 


arg 


arg 


arg 


thr arg 


ser 


arg 


cys 


arg 


ala 


arg 


trp 


ala 



303/101 

GAA ACC TCG GCG ACG ATC- 
glu thr. ser ala thr ile 



SEQ ID No.9C 
FIGURE 9C 



31/11 



TTA ACG 


ACT 


CAG 


ACG 


GAA 


ACG 


CTT 


GAA 


CCG 


CGA GGT 


CGC 


TCC 


GGA 


CAC 


CAA 


TTT 


GAC 


TCG 


leu thr 


thr 


gin 


thr 


glu 


thr 


leu 


glu 


pro 


arg gly 


arg 


ser 


gly 


his 


gin 


phe 


asp 


ser 


61/21 


















91/31 


















GCT CTT 


TGG 


CAA 


TTG 


AAG 


GTG 


AGC 


TGC 


GAG 


CAG CCG 


GGT 


GAC 


CGC 


ATC 


GTT 


GGC 


CTT 


GCC 


ala leu 


trp 


gin 


leu 


lys 


val 


ser 


cys 


glu 


gin pro 


gly 


asp 


arg 


ile 


val 


gly 


leu 


ala 


121/41 


















151/51 


















ATC AAT 


CGC 


CGG 


CTC 


GCG 


GAC 


GTA 


GAT 


AAT 


CAG CTC 


ACC 


GTT 


GGG 


ACC 


GAC 


CTC 


GAC 


CAG 


ile asn 


arg 


arg 


leu 


ala 


asp 


val 


asp 


asn 


gin leu 


thr 


val 


gly 


thr 


asp 


leu 


asp 


gin 


181/61 


















211/71 


















GGG TCC 


TTT 


GTG 


ACT 


GCC 


GGG 


CTT 


GAC 


GCG 


GAC GAC 


CAC 


AGA 


GTC 


GGT 


CAT 


CGC 


CTA 


AGG 


gly ser 


phe 


val 


thr 


ala 


gly 


leu 


asp 


ala 


asp asp 


his 


arg 


val 


gly 


his 


arg 


leu 


arg 


241/81 


















271/91 


















CTA CCG 


TTC 


TGA 


CCT 


GGG 


GCT 


GCG 


TGG 


GCG 


CCG ACG 


ACG 


TGA 


GGC 


ACG 


TCA 


TGT 


CTC 


AGC 


leu pro 


phe 


OPA 


pro 


gly 


ala 


ala 


trp 


ala 


pro thr 


thr 


OPA 


gly 


thr 


ser 


cys 


leu 


ser 


301/101 


















331/111 


















GGC CCA 


CCG 


CCA 


CCT 


CGG 


TCG 


CCG 


GCA 


GTA 


TGT CAG 


CAT 


GTG 


CAG 


ATG 


ACT 


CCA 


CGC 


AGC 


gly pro 


pro 


pro 


pro 


arg 


ser 


pro 


ala 


val 


cys gin 


his 


val 


gin 


met 


thr 


pro 


arg 


ser 


361/121 


















391/131 


















CTT GTT 


CGC 


ATC 


GTT 


GGT 


GTC 


GTG 


GTT 


GCG 


ACG ACC 


TTG 


GCG 


CTG 


GTG 


AGC 


GCA 


CCC 


GCC 


leu val 


arg 


ile 


val 


gly 


val 


val 


val 


ala 


thr thr 


leu 


ala 


leu 


val 


ser 


ala 


pro 


ala 


421/141 




































GGC .GGT 


CGT 


GCC 


GCG 


CAT 


GCG 


GAT 


C 






















gly gly 


arg 


ala 


ala 


his 


ala 


asp 

























SEQ ID No.lOA 
FIGURE 10A 



REPLACEMENT SHEET (RULE 26) 




31/185 



32/11 



TAA CGA 


CTC 


AGA 


CGG 


AAA 


CGC 


TTG 


AAC 


CGC 


GAG GTC 


GCT 


CCG 


GAC 


ACC 


AAT 


TTG 


ACT 


CGG 


OCH arg 


leu 


arg 


arg 


lys 


arg 


leu 


asn 


arg 


glu val 


ala 


pro 


asp 


thr 


asn 


leu 


thr 


arg 


62/21 


















92/31 


















CTC TTT 


GGC 


AAT 


TGA 


AGG 


TGA 


GCT 


GCG 


AGC 


AGC CGG 


GTG 


ACC 


GCA 


TCG 


TTG 


GCC 


TTG 


CCA 


leu phe 


gly 


asn 


OPA 


arg 


OPA 


ala 


ala 


ser 


ser arg 


val 


thr 


ala 


ser 


leu 


ala 


leu 


pro 


122/41 
















152/51 


















TCA ATC 


GCC 


GGC 


TCG 


CGG 


ACG 


TAG 


ATA 


ATC 


AGC TCA 


CCG 


TTG 


GGA 


CCG 


ACC 


TCG 


ACC 


AGG 


ser ile 


ala 


gly 


ser 


arg 


thr 


AMB 


ile 


ile 


ser ser 


pro 


leu 


gly 


pro 


thr 


ser 


thr 


arg 


182/61 
















212/71 


















GGT CCT 


TTG 


TGA 


CTG 


CCG 


GGC 


TTG 


ACG 


CGG 


ACG ACC 


ACA 


GAG 


TCG 


GTC 


ATC 


GCC 


TAA 


r* 


gly pro 


leu 


OPA 


leu 


pro 


gly 


leu 


thr 


arg 


thr thr 


thr 


glu 


ser 


val 


ile 


ala 


OCH 


gly 


242/81 


















272/91 


















TAC CGT 


TCT 


GAC 


CTG 


GGG 


CTG 


CGT 


GGG 


CGC 


CGA CGA 


CGT 


GAG 


GCA 


CGT 


CAT 


GTC 


TCA 


GCG 


tyr arg 


ser 


asp 


leu 


gly 


leu 


arg 


gly 


arg 


arg arg 


arg 


glu 


ala 


arg 


his 


val 


ser 


ala 


302/101 


















332/111 


















GCC CAC 


CGC 


GAC 


CTC 


GGT 


CGC 


CGG 


CAG 


TAT 


GTC AGC 


ATG 


TGC 


AGA 


TGA 


CTC 


CAC 


GCA 


GCC 


ala his 


arg 


his 


leu 


gly 


arg 


arg 


gin 


tyr 


val ser 


met 


cys 


arg 


OPA 


leu 


his 


ala 


ala 


362/121 


















392/131 


















TTG TTC 


GCA 


TCG 


TTG 


GTG 


TCG 


TGG 


TTG 


CGA 


CGA CCT 


TGG 


CGC 


TGG 


TGA 


GCG 


CAC 


CCG 


CCG 


leu phe 


ala 


ser 


leu 


val 


ser 


trp 


leu 


arg 


arg pro 


trp 


arg 


trp 


OPA 


ala 


his 


pro 


pro 



422/141 

GCG GTC GTG CCG CGC ATG CGG ATC 
ala val val pro arg met arg Ile 

SEQ ID No.lOB 
FIGURE 10B 





















33/11 


















AAC GAC 


TCA 


GAC 


GGA 


AAC 


GCT 


TGA 


ACC 


GCG 


AGG TCG 


CTC 


CGG 


ACA 


CCA 


ATT 


TGA 


CTC 


GGC 


asn asp 


ser 


asp 


gly 


asn 


ala 


OPA 


thr 


ala 


arg ser 


leu 


arg 


thr 


pro 


ile 


OPA 


leu 


gly 


63/21 
















93/31 


















TCT TTG 


GCA 


ATT 


GAA 


GGT 


GAG 


CTG 


CGA 


GCA 


GCC GGG 


TGA 


CCG 


CAT 


CGT 


TGG 


CCT 


TGC 


CAT 


ser leu 


ala 


ile 


glu 


gly 


glu 


leu 


arg 


ala 


ala gly 


OPA 


pro 


his 


arg 


trp 


pro 


cys 


his 


123/41 
















153/51 


















CAA TCG 


CCG 


GCT 


CGC 


GGA 


CGT 


AGA 


TAA 


TCA 


GCT CAC 


CGT 


TGG 


GAC 


CGA 


CCT 


CGA 


CCA 


GGG 


gin ser 


pro 


ala 


arg 


gly 


arg 


arg 


OCH 


ser 


ala his 


arg 


trp 


asp 


arg 


pro 


arg 


pro 


gly 


183/61 
















213/71 


















GTC CTT 


TGT 


GAC 


TGC 


CGG 


GCT 


TGA 


CGC 


GGA 


CGA CCA 


CAG 


AGT 


CGG 


TCA 


TCG 


CCT 


AAG 


GCT 


val leu 


cys 


asp 


cys 


arg 


ala 


OPA 


arg 


gly 


arg pro 


gin 


ser 


arg 


ser 


ser 


pro 


lys 


ala 


243/81 
















273/91 


















ACC GTT 


CTG 


ACC 


TGG 


GGC 


TGC 


GTG 


GGC 


GCC 


GAC GAC 


GTG 


AGG 


CAC 


GTC 


ATG 


TCT 


CAG 


CGG 


thr val 


leu 


thr 


trp 


gly 


cys 


val 


gly 


ala 


asp asp 


val 


arg 


his 


val 


met 


ser 


gin 


arg 


303/101 


















333/111 
















CCT 


CCC ACC 


GCC 


ACC 


TCG 


GTC 


GCC 


GGC 


AGT 


ATG 


TCA GCA 


TGT 


GCA 


GAT 


GAC 


TCC 


ACG 


CAG 


pro thr 


ala 


thr 


ser 


val 


ala 


gly 


ser 


met 


ser ala 


cys 


ala 


asp 


asp 


ser 


thr 


gin 


pro 


363/121 


















393/131 
















CGG 


TGT TCG 


CAT 


CGT 


TGG 


TGT 


CGT 


GGT 


TGC 


GAC 


GAC CTT 


GGC 


GCT 


GGT 


GAG 


CGC 


ACC 


CGC 


cys ser 


his 


arg 


trp 


cys 


arg 


gly 


cys 


asp 


asp leu 


gly 


ala 


gly 


glu 


arg 


thr 


arg 


arg 


423/141 




































CGG TCG 


TGC 


CGC 


GCA 


TGC 


GGA 


TC 
























arg ser 


cys 


arg 


ala 


cys 


gly 



























SEQ ID No. IOC 
FIGURE IOC 



REPLACEMENT SHEET (RULE 26) 




32/185 



31/11 



CCC GAA 


GAG 


GTC 


CCC 


CGT 


TTT 


GTT 


AAT 


TTT 


TAA AAA 


ATT 


TGT 


GTC 


ACA 


AAC 


CGG 


GGT 


ACC 


pro glu 


glu 


val 


pro 


arg 


phe 


val 


asp 


phe 


OCH lys 


ile 


cys 


val 


thr 


lys 


arg 


gly 


thr 


61/21 


















91/31 


















AAG GCA 


TAA 


AAC 


CTA 


GTA 


CCT 


GGG 


GCG 


GCG 


GAT TCA 


ACG 


AAA 


ACC 


GAG 


TGG 


GGG 


TAG 


TCA 


lys ala 


OCH 


asn 


leu 


val 


pro 


gly 


ala 


ala 


asp ser 


thr 


lys 


thr 


glu 


trp 


gly AMB 


ser 


121/41 


















151/51 


















GGG GCG 


TGC 


ATT 


CCG 


ACG 


ACC 


CTG 


TAC 


GAC 


CCG CTG 


GTG 


GCA 


ACG 


CCG 


ATG 


AGT 


GCG 


CCG 


gly ala 


cys 


ile 


pro 


thr 


thr 


leu 


tyr 


asp 


pro leu 


val 


ala 


thr 


pro 


met 


ser 


ala 


pro 


181/61 


















211/71 


















ACG AAG 


GCC 


GAG 


CGA 


CGG 


GCT 


GCC 


GGC 


GCT 


GAC CGC 


CGC 


GGA 


AGC 


CGC 


CGA 


GTG 


CAT 


GGT 


thr lys 


ala 


glu 


arg 


arg 


ala 


ala 


gly 


ala 


asp arg 


arg 


gly 


ser 


arg 


arg 


val 


asp 


gly 


241/81 


















271/91 


















CAC CAC 


CGC 


CCG 


CAC 


CCG 


ACC 


GGT 


ACG 


GAT 


CGC GCC 


TCG 


GGT 


TAC 


CGT 


CGC 


CGT 


CAA 


CGC 


his his 


arg 


pro 


his 


pro 


thr 


gly 


thr 


asp 


arg ala 


ser 


gly 


tyr 


arg 


arg 


arg 


gin 


arg 


301/101 


















331/111 


















GCT GGA 


CAG 


CAT 


CGG 


TCC 


CCG 


CTG 


GGT 


CAA 


TGC ACT 


CAT 


GCA 


GCG 


CCG 


CAA 


CGA 


ACA 


GCT 


ala gly 


gin 


his 


arg 


ser 


pro 


leu 


gly 


gin 


cys thr 


his 


ala 


ala 


pro 


gin 


arg 


thr 


ala 


361/121 


















391/131 


















CAA CCC 


TTG 


AAC 


CGG 


GTC 


CCG 


GCC 


TGC 


CGA 


CCC TCG 


GCC 


GCC 


GGC 


GTG 


CCG 


CTA 


CGT 


GAT 


gin pro 


leu 


asn 


arg 


val 


pro 


ala 


cys 


arg 


pro ser 


ala 


ala 


gly 


val 


pro 


leu 


arg 


asp 


421/141 


















451/151 


















AGA CAC 


AGG 


GCC 


ATG 


GAA 


ATC 


CTG 


GCC 


AGC 


CGG ATG 


CTA 


CTT 


CGG 


CCG 


GCG 


GAC 


TAT 


CAG 


arg his 


arg 


ala 


met 


glu 


ile 


leu 


ala 


ser 


arg met 


leu 


leu 


arg 


pro 


ala 


asp 


tyr 


gin 


481/161 




































CGG TCG 


CTG 


AGC 


TTC 


TAC 


CGT 


GAC 


CAG 


ATC 




















arg ser 


leu 


ser 


phe 


tyr 


arg 


asp 


gin 


ile 




































SEQ ID No.llA 






























FIGURE 


11A 


































32/11 


















CCG AAG 


AGG 


TCC 


CCC 


GTT 


TTG 


TTA 


ATT 


TTT 


AAA AAA 


TTT 


GTG 


TCA 


CAA 


AGC 


GGG 


GTA 


CCA 


pro lys 


arg 


ser 


pro 


val 


leu 


leu 


ile 


phe 


lys lys 


phe 


val 


ser 


gin 


ser 


gly val 


pro 


62/21 


















92/31 


















AGG CAT 


AAA 


ACC 


TAG 


TAC 


CTG 


GGG 


CGG 


CGG 


ATT CAA 


CGA 


AAA 


CCG 


AGT 


GGG 


GGT 


AGT 


CAG 


arg his 


lys 


thr 


AMB 


tyr 


leu 


gly 


arg 


arg 


ile gin 


arg 


lys 


pro 


ser 


gly 


gly 


ser 


gin 


122/41 


















152/51 


















GGG CGT 


GCA 


TTC 


CGA 


CGA 


CCC 


TGT 


ACG 


ACC 


CGC TGG 


TGG 


CAA 


CGC 


CGA 


TGA 


GTG 


CCC 


CGA 


gly arg 


ala 


phe 


arg 


arg 


pro 


cys 


thr 


thr 


arg trp 


trp 


gin 


arg 


arg 


OPA 


val 


arg 


arg 


182/61 


















212/71 


















CGA AGG 


ccG 


AGC 


GAC 


GGG 


CTG 


CCG 


GCG 


CTG 


ACC GCC 


GCG 


GAA 


GCC 


GCC 


GAG 


TGG 


ATG 


GTC 


arg arg 


pro 


ser 


asp 


gly 


leu 


pro 


ala 


leu 


thr ala 


ala 


glu 


ala 


ala 


glu 


trp 


met 


val 


242/81 


















272/91 


















ACC ACC 


GCC 


CGC 


ACC 


CGA 


CCG 


GTA 


CGG 


ATC 


GCG CCT 


CGG 


GTT 


ACC 


GTC 


GCC 


GTC 


ACC 


GCG 


thr thr 


ala 


arg 


thr 


arg 


pro 


val 


arg 


ile 


ala pro 


arg 


val 


thr 


val 


ala 


val 


asn 


ala 


302/101 


















332/111 


















CTG GAC 


AGC 


ATC 


GGT 


CCC 


CGC 


TGG 


GTC 


AAT 


GCA CTC 


ATG 


CAG 


CGC 


CGC 


AAC 


GAA 


CAG 


CTC 


leu asp 


ser 


ile 


gly 


pro 


arg 


trp 


val 


asn 


ala leu 


met 


gin 


arg 


arg 


asn 


glu 


gin 


leu 


362/121 


















392/131 


















AAC CCT 


TGA 


ACC 


GGG 


TCC 


CGG 


CCT 


GCC 


GAC 


CCT CGG 


CCG 


CCG 


GCG 


TGC 


CGC 


TAC 


GTG 


ATA 


asn pro 


OPA 


thr 


gly 


ser 


arg 


pro 


ala 


asp 


pro arg 


pro 


pro 


ala 


cys 


arg 


tyr 


val 


ile 


422/141 


















452/151 


















GAC ACA 


GGG 


CCA 


TGG 


AAA 


TCC 


TGG 


CCA 


GCC 


GGA TGC 


TAC 


TTC 


GCC 


CGG 


CGG 


ACT 


ATC 


AGC 


asp thr 


gly 


pro 


trp 


lys 


ser 


trp 


pro 


ala 


gly cys 


tyr 


phe 


gly 


arg 


arg 


thr 


ile 


ser 


482/161 




































GGT CGC 


TGA 


GCT 


TCT 


ACC 


GTG 


ACC 


AGA 


TC 




















gly arg 


OPA 


ala 


ser 


thr 


val 


thr 


arg 























SEQ ID No.llB 
FIGURE 11B 



REPLACEMENT SHEET (RULE 26) 




33/185 



33/11 



CGA AGA 


GGT 


CCC 


CCG 


TTT 


TGT 


TAA 


TTT 


TTA 


AAA AAT 


TTG 


TGT 


CAC 


AAA 


GCG 


GGG 


TAC 


CAA 


arg arg 


gly 


pro 


pro 


phe 


cys 


OCH 


phe 


leu 


lys asn 


leu 


cys 


his 


lys 


ala 


gly 


tyr 


gin 


63/21 


















93/31 


















GGC ATA 


AAA 


CCT 


AGT 


ACC 


TGG 


GGC 


GGC 


GGA 


TTC AAC 


GAA 


AAC 


CGA 


GTG 


GGG 


GTA 


GTC 


AGG 


gly ile 


lys 


pro 


ser 


thr 


trp 


gly 


gly 


gly 


phe asn 


glu 


asn 


arg 


val 


gly 


val 


val 


arg 


123/41 


















153/51 


















GGC GTG 


CAT 


TCC 


GAC 


GAC 


CCT 


GTA 


CGA 


CCC 


GCT GGT 


GGC 


AAC 


GCC 


GAT 


GAG 


TGC 


GCC 


GAC 


gly val 


his 


ser 


asp 


asp 


pro 


val 


arg 


pro 


ala gly 


gly 


asn 


ala 


asp 


glu 


cys 


ala 


asp 


183/61 


















213/71 


















GAA GGC 


CGA 


GCG 


ACG 


GGC 


TGC 


CGG 


CGC 


TGA 


CCG CCG 


CGG 


AAG 


CCG 


CCG 


AGT 


GGA 


TGG 


TCA 


glu gly 


arg 


ala 


thr 


gly 


cys 


arg 


arg 


OPA 


pro pro 


arg 


lys 


pro 


pro 


ser 


gly 


trp 


ser 


243/81 


















273/91 


















CCA CCG 


CCC 


GCA 


CCC 


GAC 


CGG 


TAC 


GGA 


TCG 


CGC CTC 


GGG 


TTA 


CCG 


TCG 


CCG 


TCA 


ACG 


CGC 


pro pro 


pro 


ala 


pro 


asp 


arg 


tyr 


gly 


ser 


arg leu 


gly 


leu 


pro 


ser 


pro 


ser 


thr 


arg 


303/101 
















333/111 


















TGG ACA 


GCA 


TCG 


GTC 


CCC 


GCT 


GGG 


TCA 


ATG 


CAC TCA 


TGC 


AGC 


GCC 


GCA 


ACG 


AAC 


AGC 


TCA 


trp thr 


ala 


ser 


val 


pro 


ala 


gly 


ser 


met 


his ser 


cys 


ser 


ala 


ala 


thr 


asn 


ser 


ser 


363/121 


















393/131 


















ACC CTT 


GAA 


CCG 


GGT 


CCC 


GGC 


CTG 


CCG 


ACC 


CTC GGC 


CGC 


CGG 


CGT 


GCC 


GCT 


ACG 


TGA 


TAG 


thr leu 


glu 


pro 


gly 


pro 


gly 


leu 


pro 


thr 


leu gly 


arg 


arg 


arg 


ala 


ala 


thr 


OPA 


AMB 


4 7 ^ / 1 41 

*i Z. J / J- *± X 
















453/151 


















ACA CAG 


GGC 


CAT 


GGA 


AAT 


CCT 


GGC 


CAG 


CCG 


GAT GCT 


ACT 


TCG 


GCC 


GGC 


GGA 


CTA 


TCA 


GCG 


thr gin 


gly 


his 


gly 


asn 


pro 


gly 


gin 


pro 


asp ala 


thr 


ser 


ala 


gly 


gly 


leu 


ser 


ala 


483/161 




































GTC GCT 


GAG 


CTT 


CTA 


CCG 


TGA 


CCA 


GAT 


C 




















\;2 "1 3 1 a 
V d _L CI J- CL 


glu 


leu 


leu 


pro 


OPA 


pro 


asp 




































SEQ ID 


No. 11C 






























FIGURE 11C 


















part of 


the 


nucleotide sequence 


of 


Seqll 


















1/1 


















31/11 


















CGT CGC 


CGT 


CAA 


CGC 


GCT 


GGA 


CAG 


CAT 


CGG 


TCC CCG 


CTG 


GGT 


CAA 


TGC 


ACT 


CAT 


GCA 


GCG 


arg arg 


arg 


gin 


arg 


ala 


gly 


gin 


his 


arg 


ser pro 


leu 


gly 


gin 


cys 


thr 


his 


ala 


ala 


61/21 


















91/31 
















GGC 


CCG CAA 


CGA 


ACA 


GCT 


CAA 


CCC 


TTG 


AAC 


CGG 


GTC CCG 


GCC 


TGC 


CGA 


CCC 


TCG 


GCC 


GCC 


pro gin 


arg 


thr 


ala 


gin 


pro 


leu 


asn 


arg 


val pro 


ala 


cys 


arg 


pro 


ser 


ala 


ala 


gly 


121/41 
















151/51 
















CGG 


GTG CCG 


CTA 


CGT 


GAT 


AGA 


CAC 


AGG 


GCC 


ATG 


GAA ATC 


CTG 


GCC 


AGC 


CGG 


ATG 


CTA 


CTT 


val pro 


leu 


arg 


asp 


arg 


his 


arg 


ala 


met 


glu ile 


leu 


ala 


ser 


arg 


met 


leu 


leu 


arg 


181/61 


















211/71 


















CCG GCG 


GAC 


TAT 


CAG 


CGG 


TCG 


CTG 


AGC 


TTC 


TAC CGT 


GAC 


CAG 


ATC 












pro ala 


asp 


tyr 


gin 


arg 


ser 


leu 


ser 


phe 


tyr arg 


asp 


gin 


ile 













SEQ ID No.llA' 
FIGURE 11A 1 



REPLACEMENT SHEET (RULE 26) 



• 



34/185 





GTC 


AAC 


GCG 


CTG 


GAC 


AGC 


ATC 


GGT 


CCC CGC 


TGG 


GTC 


AAT 


GCA 


CTC 


ATG 


CAG 


CGC 


val ala 


val 


asn 


ala 


leu 


asp 


ser 


ile 


gly 


pro arg 


trp 


val 


asn 


ala 


leu 


met 


gin 


arg 


61/21 


















91/31 






















CAG 


CTC 


AAC 


CCT 


TGA 


ACC 


GGG 


TCC CGG 


CCT 


GCC 


GAC 


CCT 


CGG 


CCG 


CCG 


GCG 


arg asn 


glu 


gin 


leu 


asn 


pro 


OPA 


thr 


gly 


ser arg 


pro 


ala 


asp 


pro 


arg 


pro 


pro 


ala 


121/41 


















151/51 




















TAC 


GTG 


ATA 


GAC 


ACA 


GGG 


CCA 


TGG 


AAA TCC 


TGG 


CCA 


GCC 


GGA 


TGC 


TAC 


TTC 


GGC 


cys arg 


tyr 


val 


ile 


asp 


thr 


gly 


pro 


trp 


lys ser 


trp 


pro 


ala 


gly 


cys 


tyr 


phe 


gly 


181/61 


















211/71 


















CGG CGG 


ACT 


ATC 


AGC 


GGT 


CGC 


TGA 


GCT 


TCT 


ACC GTG 


ACC 


AGA 


TC 












arg arg 


thr 


ile 


ser 


gly 


arg 


OPA 


ala 


ser 


thr val 


thr 


arg 




























SEQ 


ID No 


.HB f 






























FIGURE llB f 
















1/1 


















31/11 


















TCG CCG 


TCA 


ACG 


CGC 


TGG 


ACA 


GCA 


TCG 


GTC 


CCC GCT 


GGG 


TCA 


ATG 


CAC 


TCA 


TGC 


AGC 


GCC 


ser pro 


ser 


thr 


arg 


trp 


thr 


ala 


ser 


val 


pro ala 


gly 


ser 


met 


his 


ser 


cys 


ser 


ala 


61/21 


















91/31 


















GCA ACG 


AAC 


AGC 


TCA 


ACC 


CTT 


GAA 


CCG 


GGT 


CCC GGC 


CTG 


CCG 


ACC 


CTC 


GGC 


CGC 


CGG 


CGT 


ala thr 


asn 


ser 


ser 


thr 


leu 


glu pro 


gly 


pro gly 


leu 


pro 


thr 


leu 


gly 


arg 


arg 


arg 


121/41 


















151/51 


















GCC GCT 


ACG 


TGA 


TAG 


ACA 


CAG 


GGC 


CAT 


GGA 


AAT CCT 


GGC 


CAG 


CCG 


GAT 


GCT 


ACT 


TCG 


GCC 


ala ala 


thr 


OPA 


AMB 


thr 


gin 


gly his 


gly 


asn pro 


gly 


gin 


pro 


asp 


ala 


thr 


ser 


ala 


181/61 


















211/71 


















GGC GGA 


CTA 


TCA 


GCG 


GTC 


GCT 


GAG 


CTT 


CTA 


CCG TGA 


CCA 


GAT 


C 












gly gly 


leu 


ser 


ala 


val 


ala 


glu 


leu 


leu 


pro OPA 


pro 


asp 






























SEQ 


ID No 


. lie 















FIGURE llC f 



sequence Rv054 6c predicted by Cole et al . (Nature 393:537-54 4) and containing SeqllA' 



1/1 


















31/11 


















atg gaa 


ate 


ctg 


gec 


age 


egg 


atg 


eta 


ctt 


egg ccg 


gcg 


gac 


tat 


cag 


egg 


teg 


ctg 


age 


Met glu 


ile 


leu 


ala 


ser 


arg 


met 


leu 


leu 


arg pro 


ala 


asp 


tyr 


gin 


arg 


ser 


leu 


ser 


61/21 


















91/31 


















ttc tac 


cgt 


gac 


cag 


ate 


ggg 


ctg 


gcg 


att 


gec cgt 


gaa 


tac 


ggg 


gee 


ggc 


aca 


gtg 


ttt 


phe tyr 


arg 


asp 


gin 


ile 


gly 


leu 


ala 


ile 


ala arg 


glu 


tyr 


gly 


ala 


gly 


thr 


val 


phe 


121/41 


















151/51 


















ttc gec 


ggt 


cag 


tea 


ctg 


etc 


gaa 


ctg 


gec 


ggt tac 


ggc 


gag 


ccg 


gac 


cat 


teg 


egg 


gga 


phe ala 


gly 


gin 


ser 


leu 


leu 


glu 


leu 


ala 


gly tyr 


gly 


glu 


pro 


asp 


his 


ser 


arg 


gly 


181/61 


















211/71 


















cct ttt 


ccc 


ggc 


gcg 


ctg 


tgg 


ctg 


cag 


gtg 


cgc gac 


etc 


gag 


get 


acc 


cag 


acc 


gag 


ctg 


pro phe 


pro 


gly 


ala 


leu 


trp 


leu 


gin 


val 


arg asp 


leu 


glu 


ala 


thr 


gin 


thr 


glu 


leu 


241/81 


















271/91 


















gtc age 


cga 


ggc 


gtg 


teg 


ate 


get 


cgc 


gag 


ccc cgc 


cgc 


gaa 


ccg 


tgg 


ggc 


ctg 


cac 


gag 


val ser 


arg 


gly 


val 


ser 


ile 


ala 


arg 


glu 


pro arg 


arg 


glu 


pro 


trp 


gly 


leu 


his 


glu 


301/101 
















331/111 


















atg cat 


gtg 


acc 


gac 


cca 


gac 


ggg 


ate 


aca 


ctg ata 


ttc 


gtc 


gag 


gtt 


ccc 


gag 


ggt 


cac 


met his 


val 


thr 


asp 


pro 


asp 


gly 


ile 


thr 


leu ile 


phe 


val 


glu 


val 


pro 


glu 


gly 


his 


361/121 




































ccg ctg 


cgt 


aca 


gac 


acc 


egg 


gcg 


tga 






















pro leu 


arg 


thr 


asp 


thr 


arg 


ala 


OPA 























SEQ ID No.llD 



FIGURE 11D 
REPLACEMENT SHEET (RULE 26) 



35/185 



ORF predicted by Cole et al . (Nature 393:537-544) and containing Rv0546c 



1/1 


















31/11 


















tag tea 


ggg 


cgt 


gca 


ttc 


gac 


gac 


get 


gta 


eta ccc 


get 


ggt 


ggc 


aac 


tec 


gat 


gat 


tgc 


AMB ser 


gly 


arg 


ala 


phe 


asp 


asp 


ala 


val 


leu pro 


ala 


gly 


gly 


asn 


ser 


asp 


asp 


cys 


61/21 


















91/31 


















gec gac 


gaa 


ggc 


eta 


cga 


egg 


get 


gee 


ggc 


get gac 


cgc 


cgc 


gga 


age 


cgc 


cga 


gtg 


gat 


ala asp 


glu 


gly 


leu 


arg 


arg 


ala 


ala 


gly 


ala asp 


arg 


arg 


gly 


ser 


arg 


arg 


val 


asp 


121/41 


















151/51 


















ggt cac 


cgc 


cgc 


ccg 


cac 


ccg 


acc 


ggt 


gcg 


gat cgc 


gee 


teg 


ggt 


tgc 


cgt 


cgc 


cgt 


caa 


gly his 


arg 


arg 


pro 


his 


pro 


thr 


gly 


ala 


asp arg 


ala 


ser 


gly 


cys 


arg 


arg 


arg 


gin 


181/61 


















211/71 


















cgc get 


gga 


cag 


cat 


egg 


tec 


ccg 


ctg 


ggt 


caa tgc 


act 


cat 


gca 


gcg 


ccg 


caa 


cga 


aca 


arg ala 


gly 


gin 


his 


arg 


ser 


pro 


leu 


gly 


gin cys 


thr 


nis 


ala 


ala 


pro 


gin 


arg 


4- 1_ -w- 

tnr 


241/81 


















O "7 1 / Q 1 

i. / 1/ y i 


















get caa 


ccc 


ttg 


aac 


egg 


gtc 


ccg 


gee 


tgc 


cga ccc 


teg 


gee 


gee 


ggc 


gtg 


ccg 


cna 


cgt 


ala gin 


pro 


leu 


asn 


arg 


val 


pro 


ala 


cys 


arg pro 


ser 


ala 


ala 


gly 


val 


pro 


leu 


a rg 


301/101 


















OQ1 / 1 1 T 

6 6 1/ 111 


















gat aga 


cac 


agg 


gee 


atg 


gaa 


ate 


ctg 


gee 


age egg 


atg 


eta 


ctt 


egg 


ccg 


gcg 


gac 


tac 


asp arg 


his 


arg 


ala 


met 


glu 


ile 


leu 


ala 


ser arg 


met 


i ... 
leu 


leu 


arg 


pro 


aia 


asp 


tyr 


361/121 




































cag egg 


teg 


ctg 


age 


ttc 


tac 


cgt 


gac 


cag 


ate ggg 


ctg 


gcg 


att 


gec 


cgt 


gaa 


tac 


ggg 


gin arg 


ser 


leu 


ser 


phe 


tyr 


arg 


asp 


gin 


ile gly 


leu 


ala 


lie 


aia 


arg 


glu 


tyr 


gly 


421/141 


















4 Ol/ lOl 


















gee ggc 


aca 


gtg 


4-4-4- 
ttt 




gee 


ggt 


cag 






gaa 


ctg 


gec 


y y i- 


tac 


n n r* 


y ay 


ccg 


ala gly 


thr 


val 


phe 


phe 


ala 


gly 


gin 


ser 


leu leu 


glu 


leu 


ala 


gly 


tyr 


gly 


glu 


pro 


481/161 


















511/171 


















gac cat 


teg 


egg 


gga 


cct 


ttt 


ccc 


ggc 


gcg 


ctg tgg 


ctg 


cag 


gtg 


cgc 


gac 


etc 


gag 


get 


asp his 


ser 


arg 


gly 


pro 


phe 


pro 


gly 


ala 


leu trp 


leu 


gin 


val 


arg 


asp 


leu 


glu 


ala 


541/181 


















571/191 


















ace cag 


acc 


gag 


ctg 


gtc 


age 


cga 


ggc 


gtg 


teg ate 


get 


cgc 


gag 


ccc 


cgc 


cgc 


gaa 


ccg 


thr gin 


thr 


glu 


leu 


val 


ser 


arg 


gly 


val 


ser ile 


ala 


arg 


glu 


pro 


arg 


arg 


glu 


pro 


601/201 


















631/211 


















tgg ggc 


ctg 


cac 


gag 


atg 


cat 


gtg 


acc 


gac 


cca gac 


ggg 


ate 


aca 


ctg 


ata 


ttc 


gtc 


gag 


trp gly 


leu 


his 


glu 


met 


his 


val 


thr 


asp 


pro asp 


gly 


ile 


thr 


leu 


ile 


phe 


val 


glu 


661/221 


















691/231 


















gtt ccc 


gag 


ggt 


cac 


ccg 


ctg 


cgt 


aca 


gac 


acc egg 


gcg 


tga 














val pro 


glu 


gly 


his 


pro 


leu 


arg 


thr 


asp 


thr arg 


ala 


OPA 















SEQ ID No.llF 
FIGURE 11F 



REPLACEMENT SHEET (RULE 26) 




36/185 



1/1 


















31/1 1 


















gac cga 


agg 


gat 


ttc 


gcg 


act 


aac 


teg 


gcc 


tgt aag 


gca 


acg 


cga 


ggt 


ctt 


cat 


gcc 


gag 


asp arg 


arg 


asp 


phe 


ala 


thr 


asn 


ser 


ala 


cys lys 


ala 


tnr 


arg 


gly 


leu 


nis 


ala 


glu 


/on 
Ol/ ^ 1 


















3 1/ «J X 


















gac gta 


gac 


agg 


aag 


aga 


cag 


gga 


age 


tga 


tga cgt 


cgc 


gta 


ccg 


gac 


cgc 


cat 


tct 


gtc 


asp val 


asp 


arg 


lys 


arg 


gin 


gly 


ser 


OPA 


OPA arg 


arg 


val 


pro 


asp 


arg 


his 


ser 


val 


121/41 


















151/51 


















gag tct 


ttc 


cga 


gtt 


cag 


caa 


caa 


teg 


aca 


cag aag 


egg 


gga 


cca 


gac 


egg 


gag 


gac 


gac 


glu ser 


phe 


arg 


val 


gin 


gin 


gin 


ser 


thr 


gin lys 


arg 


gly 


pro 


asp 


arg 


glu 


asp 


asp 


181/61 


















211/71 


















gcg gcc 


egg 


gcc 


get 


teg 


ggc 


cga 


gtg 


tct 


gag taa 


gac 


cag 


agt 


cac 


ggg 


tec 


gtg 


tgt 


ala ala 


arg 


ala 


ala 


ser 


gly 


arg 


val 


ser 


glu OCH 


asp 


gin 


ser 


his 


gly 


ser 


val 


cys 


241/81 


















271/91 


















gac aac 


cgc 


gcg 


gaa 


ttc 


aat 


egg 


atg 


gcg 


ggc ggg 


acc 


gga 


ttg 


cgc 


egg 


tea 


ccg 


agg 


asp asn 


arg 


ala 


glu 


phe 


asn 


arg 


met 


ala 


gly gly 


thr 


gly 


leu 


arg 


arg 


ser 


pro 


arg 



301/101 

aac etc egg agt gat c 
asn leu arg ser asp 



SEQ ID No.l2A 
FIGURE 12A 



1/1 


















31/11 


















acc gaa 


ggg 


att 


teg 


cga 


eta 


act 


egg 


cct 


gta agg 


caa 


cgc 


gag 


gtc 


ttc 


atg 


ccg 


agg 


thr glu 


gly 


ile 


ser 


arg 


leu 


thr 


arg 


pro 


val arg 


gin 


arg 


glu 


val 


phe 


met 


pro 


arg 


61/21 


















91/31 


















acg tag 


aca 


gga 


aga 


gac 


agg 


gaa 


get 


gat 


gac gtc 


gcg 


tac 


egg 


acc 


gcc 


att 


ctg 


teg 


thr AMB 


thr 


gly 


arg 


asp 


arg 


glu 


ala 


asp 


asp val 


ala 


tyr 


arg 


thr 


ala 


ile 


leu 


ser 


121/41 


















151/51 


















agt ctt 


tec 


gag 


ttc 


age 


aac 


aat 


cga 


cac 


aga age 


ggg 


gac 


cag 


acc 


ggg 


agg 


acg 


acg 


ser leu 


ser 


glu 


phe 


ser 


asn 


asn 


arg 


his 


arg ser 


gly 


asp 


gin 


thr 


gly 


arg 


thr 


thr 


181/61 


















211/71 


















egg ccc 


ggg 


ccg 


ctt 


egg 


gcc 


gag 


tgt 


ctg 


agt aag 


acc 


aga 


gtc 


acg 


ggt 


ccg 


tgt 


gtg 


arg pro 


gly 


pro 


leu 


arg 


ala 


glu 


cys 


leu 


ser lys 


thr 


arg 


val 


thr 


gly 


pro 


cys 


val 


241/81 


















271/91 


















aca acc 


gcg 


egg 


aat 


tea 


ate 


gga 


tgg 


egg 


gcg gga 


ccg 


gat 


tgc 


gcc 


ggt 


cac 


cga 


gga 


thr thr 


ala 


arg 


asn 


ser 


ile 


gly 


trp 


arg 


ala gly 


pro 


asp 


cys 


ala 


gly 


his 


arg 


gly 



301/101 

acc tec gga gtg ate 
thr ser gly val ile 



SEQ ID No.l2B 
FIGURE 12B 



REPLACEMENT SHEET (RULE 26) 



37/185 



i/i 

ccg aag gga ttt cgc gac taa 
pro lys gly phe arg asp OCH 
61/21 

cgt aga cag gaa gag aca ggg 
arg arg gin glu glu .thr gly 
121/41 

gtc ttt ccg agt tea gca aca 
val phe pro ser ser ala thr 
181/61 

ggc ccg ggc cgc ttc ggg ccg 
gly pro gly arg phe gly pro 
241/81 

caa ccg cgc gga att caa teg 
gin pro arg gly ile gin ser 
301/101 

cct ccg gag tga tc 
pro pro glu OPA 









31/11 






etc 


ggc 


ctg 


taa ggc 


aac 


gcg 


leu 


gly 


leu 


OCH gly 


asn 


ala 








91/31 






aag 


ctg 


atg 


acg teg 


cgt 


acc 


lys 


leu 


met 


thr ser 


arg 


thr 








151/51 






ate 


gac 


aca 


gaa gcg 


ggg 


acc 


ile 


asp 


thr 


glu ala 


gly 


thr 








211/71 






agt 


gtc 


tga 


gta aga 


cca 


gag 


ser 


val 


OPA 


val arg 


pro 


glu 








271/91 






gat 


ggc 


ggg 


egg gac 


egg 


att 


asp 


gly 


gly 


arg asp 


arg 


ile 



SEQ ID No-12C 



agg tc-t tea tgc cga gga 
arg ser ser cys arg gly 

gga ccg cca ttc tgt cga 
gly pro pro phe cys arg 

aga ccg gga gga cga cgc 
arg pro gly gly arg arg 

tea egg gtc cgt gtg tga 
ser arg val arg val OPA 

gcg ccg gtc acc gag gaa 
ala pro val thr glu glu 



FIGURE 12C 

1/1 31/11 

GGG ATT TCG TTG CCC GAT GGA TTG TTT GTA CGG TTT GGG AAA AAC ACT TGA AGT CCT TTT 
gly ile ser leu pro asp gly leu phe val arg phe gly lys asn thr OPA ser pro phe 
61/21 91/31 

TAT TGG CAA TGC TGG AAA TGG ACA TTC CAA TAT TGC GCG AAT TAA CCG AAC ACG GTG AGG 
tyr trp gin cys trp lys trp thr phe gin tyr cys ala asn OCH pro asn thr val arg 
121/41 151/51 

GGG GGG CAA GCG TTT GTA CCG GGG CCA GCA AGC GCC GCC GAC CGG TTG ACC GAA GCC AGC 
gly gly gin ala phe val pro gly pro ala ser ala ala asp arg leu thr glu ala ser 
181/61 211/71 

ATG TTG TTG TGT CAG CGC GGG CTT GGT CTC GAT GTC CCG GCC TTG GCT GGA CCC GCT TCT 
met leu leu cys gin arg gly leu gly leu asp val pro ala leu ala gly pro ala ser 
241/81 271/91 

TCA AAA CAG GTT GAA CTT AAC GAC TCA AGA ACG GAA ACG CTT GAA CCG CGA CGT CGC TCC 
ser lys gin val glu leu asn asp ser arg thr glu thr leu glu pro arg arg arg ser 
301/101 331/111 

GGA CAC CAA TTT GAC TCG GCT CTT TGG CAA TTG AAG GTG AGC TGC GAG CAG CCG GGT GAC 
gly his gin phe asp ser ala leu trp gin leu lys val ser cys glu gin pro gly asp 
361/121 391/131 

CGC ATC GTT GGC CTT GCC ATC AAT CGC CGG CTC GCG GAC GTA GAT AAT CAG CTC ACC GTT 
arg ile val gly leu ala ile asn arg arg leu ala asp val asp asn gin leu thr val 
421/141 451/151 

GGG ACC GAC CTC GAC CAG GGG TCC TTT GTG ACT GCC GGG CTT GAC GCG GAC GAC CAC AGA 
gly thr asp leu asp gin gly ser phe val thr ala gly leu asp ala asp asp his arg 
481/161 511/171 

GTC GGT CAT CGC CTA AGG CTA CCG TTC TGA CCT GGG GCT GCG TGG GCG CCG ACG ACG TGA 
val gly his arg leu arg leu pro phe OPA pro gly ala ala trp ala pro thr thr OPA 
541/181 571/191 

GGC ACG TCA TGT CTC AGC GGC CCA CCG CCA CCT CGG TCG CCG GCA GTA TGT CAG CAT GTG 
gly thr ser cys leu ser gly pro pro pro pro arg ser pro ala val cys gin his val 
601/201 631/211 

CAG ATG ACT CCA CGC AGC CTT GTT CGC ATC GTT GGT GTC GTG GTT GCG ACG ACC TTG GCG 

gin met thr pro arg ser leu val arg ile val gly val val val ala thr thr leu ala 

661/221 691/231 

CTG GTG AGC GCA CCC GCC GGC GGT CGT GCC GCG CAT GCG GAT C 

leu val ser ala pro ala gly gly arg ala ala his ala asp 



SEQ ID No.l3A 
FIGURE 13A 



REPLACEMENT SHEET (RULE 26) 



38/185 



32/11 



GGA TTT 


CGT 


TGC 


CCG 


ATG 


GAT 


TGT 


TTG 


TAC 


GGT TTG 


GGA 


AAA 


ACA 


CTT 


GAA 


GTC 


CTT 


TTT 


gly phe 


arg 


cys 


pro 


met 


asp 


cys 


leu 


tyr 


gly leu 


gly 


lys 


thr 


leu 


glu 


val 


leu 


phe 


62/21 


















92/31 


















ATT GGC 


AAT 


GCT 


GGA 


AAT 


GGA 


CAT 


TCC 


AAT 


ATT GCG 


CGA 


ATT 


AAC 


CGA 


ACA 


CGG 


TGA 


GGG 


ile gly 


asn 


ala 


gly 


asn 


gly 


his 


ser 


asn 


ile ala 


arg 


ile 


asn 


arg 


thr 


arg 


OPA 


gly 


122/41 


















152/51 


















GGG GGC 


AAG 


CGT 


TTG 


TAC 


CGG 


GGC 


CAG 


CAA 


GCG CCG 


CCG 


ACC 


GGT 


TGA 


CCG 


AAG 


CCA 


GCA 


gly gly 


lys 


arg 


leu 


tyr 


arg 


gly 


gin 


gin 


ala pro 


pro 


thr 


gly 


OPA 


pro 


lys 


pro 


ala 


182/61 


















212/71 


















TGT TGT 


TGT 


GTC 


AGC 


GCG 


GGC 


TTG 


GTC 


TCG 


ATG TCC 


CGG 


CCT 


TGG 


CTG 


GAC 


CCG 


CTT 


CTT 


cys cys 


cys 


val 


ser 


ala 


gly 


leu 


val 


ser 


met ser 


arg 


pro 


trp 


leu 


asp 


pro 


leu 


leu 


242/81 


















272/91 


















CAA AAC 


AGG 


TTG 


AAC 


TTA 


ACG 


ACT 


CAA 


GAA 


CGG AAA 


CGC 


TTG 


AAC 


CGC 


GAC 


GTC 


GCT 


CCG 


gin asn 


arg 


leu 


asn 


leu 


thr 


thr 


gin 


glu 


arg lys 


arg 


leu 


asn 


arg 


asp 


val 


ala 


pro 


302/101 
















332/111 


















GAC ACC 


AAT 


TTG 


ACT 


CGG 


CTC 


TTT 


GGC 


AAT 


TGA AGG 


TGA 


GCT 


GCG 


AGC 


AGC 


CGG 


GTG 


ACC 


asp thr 


asn 


leu 


thr 


arg 


leu 


phe 


gly 


asn 


OPA arg 


OPA 


ala 


ala 


ser 


ser 


arg 


val 


thr 


362/121 


















392/131 


















GCA TCG 


TTG 


GCC 


TTG 


CCA 


TCA 


ATC 


GCC 


GGC 


TCG CGG 


ACG 


TAG 


ATA 


ATC 


AGC 


TCA 


CCG 


TTG 


ala ser 


leu 


ala 


leu 


pro 


ser 


ile 


ala 


gly 


ser arg 


thr 


AMB 


He 


He 


ser 


ser 


pro 


leu 


422/141 


















/ICO / -1 r- -| 

4 52/ 151 


















GGA CCG 


ACC 


TCG 


ACC 


AGG 


GGT 


CCT 


TTG 


TGA 


CTG CCG 


GGC 


TTG 


ACG 






ALL, 


ALA 




gly pro 


thr 


ser 


thr 


arg 


gly 


pro 


leu 


OPA 


leu pro 


gly 


leu 


thr 


arg 


tnr 


4- Vv *- 

tnr 


4- K v- 

tnr 


giu 


482/161 


















D Izl / 1 / 1 


















TCG GTC 


ATC 


GCC 


TAA 


GGC 


TAC 


CGT 


TCT 


GAC 


CTG GGG 


CTG 


CGT 


GGG 


CGC 


CGA 


CGA 


CGT 


GAG 


ser val 


ile 


ala 


OCH 


gly 


tyr 


arg 


ser 


asp 


leu gly 


leu 


arg 


gly 


arg 


arg 


arg 


arg 


glu 


542/181 


















572/191 


















GCA CGT 


CAT 


GTC 


TCA 


GCG 


GCC 


CAC 


CGC 


CAC 


CTC GGT 


CGC 


CGG 


CAG 


TAT 


GTC 


AGC 


ATG 


TGC 


ala arg 


his 


val 


ser 


ala 


ala 


his 


arg 


his 


leu gly 


arg 


arg 


gin 


tyr 


val 


ser 


met 


cys 


602/201 


















632/211 


















AGA TGA 


CTC 


CAC 


GCA 


GCC 


TTG 


TTC 


GCA 


TCG 


TTG GTG 


TCG 


TGG 


TTG 


CGA 


CGA 


CCT 


TGG 


CGC 


arg OPA 


leu 


his 


ala 


ala 


leu 


phe 


ala 


ser 


leu val 


ser 


trp 


leu 


arg 


arg 


pro 


trp 


arg 


662/221 


















692/231 


















TGG TGA 


GCG 


CAC 


CCG 


CCG 


GCG 


GTC 


GTG 


CCG 


CGC ATG 


CGG 


ATC 














trp OPA 


ala 


his 


pro 


pro 


ala 


val 


val 


pro 


arg met 


arg 


ile 















SEQ ID No.l3B 
FIGURE 13B 



REPLACEMENT SHEET (RULE 26) 



39/185 



33/11 



GAT TTC 


GTT 


GCC 


CGA 


TGG 


ATT 


GTT 


TGT 


ACG 


GTT TGG 


GAA 


AAA 


CAC 


TTG 


AAG 


TCC 


TTT 


TTA 


asp phe 


val 


ala 


arg 


trp 


ile 


val 


cys 


thr 


val trp 


glu 


lys 


his 


leu 


lys 


ser 


phe 


leu 


63/21 


















93/31 


















TTG GCA 


ATG 


CTG 


GAA 


ATG 


GAC 


ATT 


CCA 


ATA 


TTG CGC 


GAA 


TTA 


ACC 


GAA 


CAC 


GGT 


GAG 


GGG 


leu ala 


met 


leu 


glu 


met 


asp 


ile 


pro 


ile 


leu arg 


glu 


leu 


thr 


glu 


his 


gly 


glu 


gly 


123/41 


















153/51 


















GGG GCA 


AGC 


GTT 


TGT 


ACC 


GGG 


GCC 


AGC 


AAG 


CGC CGC 


CGA 


CCG 


GTT 


GAC 


CGA 


AGC 


CAG 


CAT 


gly ala 


ser 


val 


cys 


thr 


gly 


ala 


ser 


lys 


arg arg 


arg 


pro 


val 


asp 


arg 


ser 


gin 


his 


183/61 


















213/71 


















GTT GTT 


GTG 


TCA 


GCG 


CGG 


GCT 


TGG 


TCT 


CGA 


TGT CCC 


GGC 


CTT 


GGC 


TGG 


ACC 


CGC 


TTC 


TTC 


val val 


val 


ser 


ala 


arg 


ala 


trp 


ser 


arg 


cys pro 


gly 


leu 


gly 


trp 


thr 


arg 


phe 


phe 


243/81 


















273/91 


















AAA ACA 


GGT 


TGA 


ACT 


TAA 


CGA 


CTC 


AAG 


AAC 


GGA AAC 


GCT 


TGA 


ACC 


GCG 


ACG 


TCG 


CTC 


CGG 


lys thr 


gly 


OPA 


thr 


OCH 


arg 


leu 


lys 


asn 


gly asn 


ala 


OPA 


thr 


ala 


thr 


ser 


leu 


arg 


303/101 


















333/111 


















ACA CCA 


ATT 


TGA 


CTC 


GGC 


TCT 


TTG 


GCA 


ATT 


GAA GGT 


GAG 


CTG 


CGA 


GCA 


GCC 


GGG 


TGA 


CCG 


thr pro 


ile 


OPA 


leu 


gly 


ser 


leu 


ala 


ile 


glu gly 


glu 


leu 


arg 


ala 


ala 


gly 


OPA 


pro 


363/121 


















393/131 


















CAT CGT 


TGG 


CCT 


TGC 


CAT 


CAA 


TCG 


CCG 


GCT 


CGC GGA 


CGT 


AGA 


TAA 


TCA 


GCT 


CAC 


CGT 


TGG 


his arg 


trp 


pro 


cys 


his 


gin 


ser 


pro 


ala 


arg gly 


arg 


arg 


OCH 


ser 


ala 


his 


arg 


trp 


423/141 


















453/151 


















GAC CGA 


CCT 


CGA 


CCA 


GGG 


GTC 


CTT 


TGT 


GAC 


TGC CGG 


GCT 


TGA 


CGC 


GGA 


CGA 


CCA 


CAG 


AGT 


asp arg 


pro 


arg 


pro 


gly 


val 


leu 


cys 


asp 


cys arg 


ala 


OPA 


arg 


gly 


arg 


pro 


gin 


ser 


483/161 


















513/171 


















CGG TCA 


TCG 


CCT 


AAG 


GCT 


ACC 


GTT 


CTG 


ACC 


TGG GGC 


TGC 


GTG 


GGC 


GCC 


GAC 


GAC 


GTG 


AGG 


arg ser 


ser 


pro 


lys 


ala 


thr 


val 


leu 


thr 


trp gly 


cys 


val 


gly 


ala 


asp 


asp 


val 


arg 


543/181 


















573/191 


















CAC GTC 


ATG 


TCT 


CAG 


CGG 


CCC 


ACC 


GCC 


ACC 


TCG GTC 


GCC 


GGC 


AGT 


ATG 


TCA 


GCA 


TGT 


GCA 


his val 


met 


ser 


gin 


arg 


pro 


thr 


ala 


thr 


ser val 


ala 


gly 


ser 


met 


ser 


ala 


cys 


ala 


603/201 


















633/211 


















GAT GAC 


TCC 


ACG 


CAG 


CCT 


TGT 


TCG 


CAT 


CGT 


TGG TGT 


CGT 


GGT 


TGC 


GAC 


GAC 


CTT 


GGC 


GCT 


asp asp 


ser 


thr 


gin 


pro 


cys 


ser 


his 


arg 


trp cys 


arg 


gly 


cys 


asp 


asp 


leu 


gly 


ala 


663/221 


















693/231 


















GGT GAG 


CGC 


ACC 


CGC 


CGG 


CGG 


TCG 


TGC 


CGC 


GCA TGC 


GGA 


TC 














gly glu 


arg 


thr 


arg 


arg 


arg 


ser 


cys 


arg 


ala cys 


gly 

















SEQ ID No.l3C 
FIGURE 13C 



REPLACEMENT SHEET (RULE 26) 



40/185 



part of 


the 


nucleotide sequence 


of seql3A 


















1/1 


















31/11 


















GGG TCC 


TTT 


GTG 


ACT 


GCC 


GGG 


CTT 


GAC 


GCG 


GAC GAC 


CAC 


AGA 


GTC 


GGT 


CAT 


CGC 


CTA 


AGG 


gly ser 


phe 


val 


thr 


ala 


gly 


leu 


asp 


ala 


asp asp 


his 


arg 


val 


gly his 


arg 


leu 


arg 


61/21 
















91/31 


















CTA CCG 


TTC 


TGA 


CCT 


GGG 


GCT 


GCG 


TGG 


GCG 


CCG ACG 


ACG 


TGA 


GGC 


ACG 


TCA 


TGT 


CTC 


AGC 


leu pro 


phe 


OPA 


pro 


gly 


ala 


ala 


trp 


ala 


pro thr 


thr 


OPA gly 


thr 


ser 


cys 


leu 


ser 


121/41 
















151/51 


















GGC CCA 


CCG 


CCA 


CCT 


CGG 


TCG 


CCG 


GCA 


GTA 


TGT CAG 


CAT 


GTG 


CAG 


ATG 


ACT 


CCA 


CGC 


AGC 


gly pro 


pro 


pro 


pro 


arg 


ser 


pro 


ala 


val 


cys gin 


his 


val 


gin 


met 


thr 


pro 


arg 


ser 


181/61 


















211/71 


















CTT GTT 


CGC 


ATC 


GTT 


GGT 


GTC 


GTG 


GTT 


GCG 


ACG ACC 


TTG 


GCG 


CTG 


GTG 


AGC 


GCA 


CCC 


GCC 


leu val 


arg 


ile 


val 


gly 


val 


val 


val 


ala 


thr thr 


leu 


ala 


leu 


val 


ser 


ala 


pro 


ala 


241/81 




































GGC GGT 


CGT 


GCC 


GCG 


CAT 


GCG 


GAT 


C 






















gly gly 


arg 


ala 


ala 


his 


ala 


asp 

























SEQ ID No.l3A f 
FIGURE 13A T 



1/1 


















31/11 


















GGT CCT 


TTG 


TGA 


CTG 


CCG 


GGC 


TTG 


ACG 


CGG 


ACG ACC 


ACA 


GAG 


TCG 


GTC 


ATC 


GCC 


TAA 


GGC 


gly pro 


leu 


OPA 


leu 


pro 


gly 


leu 


thr 


arg 


thr thr 


thr 


glu 


ser 


val 


ile 


ala 


OCH 


gly 


61/21 


















91/31 


















TAC CGT 


TCT 


GAC 


CTG 


GGG 


CTG 


CGT 


GGG 


CGC 


CGA CGA 


CGT 


GAG 


GCA 


CGT 


CAT 


GTC 


TCA 


GCG 


tyr arg 


ser 


asp 


leu 


gly 


leu 


arg 


gly 


arg 


arg arg 


arg 


glu ala 


arg 


his 


val 


ser 


ala 


121/41 


















151/51 


















GCC CAC 


CGC 


CAC 


CTC 


GGT 


CGC 


CGG 


CAG 


TAT 


GTC AGC 


ATG 


TGC 


AGA 


TGA 


CTC 


CAC 


GCA 


GCC 


ala his 


arg 


his 


leu 


gly 


arg 


arg 


gin 


tyr 


val ser 


met 


cys 


arg 


OPA 


leu 


his 


ala 


ala 


181/61 


















211/71 


















TTG TTC 


GCA 


TCG 


TTG 


GTG 


TCG 


TGG 


TTG 


CGA 


CGA CCT 


TGG 


CGC 


TGG 


TGA 


GCG 


CAC 


CCG 


CCG 


leu phe 


ala 


ser 


leu 


val 


ser 


trp 


leu 


arg 


arg pro 


trp 


arg 


trp 


OPA 


ala 


his 


pro 


pro 



241/81 

GCG GTC GTG CCG CGC ATG CGG ATC 
ala val val pro arg met arg ile 



SEQ ID No.l3B' 
FIGURE 13B f 



1/1 


















31/11 


















GTC CTT 


TGT 


GAC 


TGC 


CGG 


GCT 


TGA 


CGC 


GGA 


CGA CCA 


CAG 


AGT 


CGG 


TCA 


TCG 


CCT 


AAG 


GCT 


val leu 


cys 


asp 


cys 


arg 


ala 


OPA 


arg 


gly 


arg pro 


gin 


ser 


arg 


ser 


ser 


pro 


lys 


ala 


61/21 














91/31 
















CGG 


ACC GTT 


CTG 


ACC 


TGG 


GGC 


TGC 


GTG 


GGC 


GCC 


GAC GAC 


GTG 


AGG 


CAC 


GTC 


ATG 


TCT 


CAG 


thr val 


leu 


thr 


trp 


gly 


cys 


val 


gly 


ala 


asp asp 


val 


arg 


his 


val 


met 


ser 


gin 


arg 


121/41 


















151/51 














CAG 


CCT 


CCC ACC 


GCC 


ACC 


TCG 


GTC 


GCC 


GGC 


AGT 


ATG 


TCA GCA 


TGT 


GCA 


GAT 


GAC 


TCC 


ACG 


pro thr 


ala 


thr 


ser 


val 


ala 


gly 


ser 


met 


ser ala 


cys 


ala 


asp 


asp 


ser 


thr 


gin 


pro 


181/61 


















211/71 
















CGG 


TGT TCG 


CAT 


CGT 


TGG 


TGT 


CGT 


GGT 


TGC 


GAC 


GAC CTT 


GGC 


GCT 


GGT 


GAG 


CGC 


ACC 


CGC 


cys ser 


his 


arg 


trp 


cys 


arg 


gly 


cys 


asp 


asp leu 


gly 


ala 


gly 


glu 


arg 


thr 


arg 


arg 



241/81 

CGG TCG TGC CGC GCA TGC GGA TC 
arg ser cys arg ala cys gly 

SEQ ID No.l3C f 
FIGURE 13C 



REPLACEMENT SHEET (RULE 26) 



41/185 



sequence Rvl984c predicted by Cole et 

seql3A' 

1/1 

atg act cca cgc age ctt gtt cgc ate gtt 
Met thr pro arg ser leu val arg ile val 
61/21 

gtg age gca ccc gec ggc ggt cgt gec gcg 
val ser ala pro ala gly gly arg ala ala 
121/41 

gtt ttc get cgc ggc acg cat cag get tct 
val phe ala arg gly thr his gin ala ser 
181/61 

gac teg ctt ace teg caa gtt ggc ggg egg 
asp ser leu thr ser gin val gly gly arg 
241/81 

gca age gac gac tac cgc gcg age gcg tea 
ala ser asp asp tyr arg ala ser ala ser 
301/101 

cag cgc ace gtc gec age tgc ccg aac ace 
gin arg thr val ala ser cys pro asn thr 
361/121 

gcg acg gtc ate gat ttg tec acc teg gcg 
ala thr val ile asp leu ser thr ser ala 
421/141 

get gtc gee ctt ttc ggc gag cca tec agt 
ala val ala leu phe gly glu pro ser ser 
481/161 

teg ttg ccg aca ate ggt ccg ctg tat age 
ser leu pro thr ile gly pro leu tyr ser 
541/181 

gat cca ata tgc acc gga ggc ggc aat att 
asp pro ile cys thr gly gly gly asn ile 
601/201 

atg aca age cag gcg gcg aca ttc gcg gcg 
met thr ser gin ala ala thr phe ala ala 



al. (Nature 


393 


: 537 


-544) and containing 


■J1 /IT 
O 1/ 1 1 


















ggt gtc 


gtg 


gtt 


gcg 


acg 


acc 


ttg 


gcg 


ctg 


gly val 


\ra 1 

va j. 


val 


ala 




L.I1X. 


1 eu 


did 


leu 


Q 1 / *5 1 


















Cat gcg 




ccg 


tgt 


t eg 


gac 


Ct L. \— 


gcg 


gec 


I IS. a axa 


asp 


pro 


cys 


ser 


asp 


ile 


ala 


va 1 


ID Lf Ol 


















ggt ctt 


ggc 


gac 


gtc 


ggt 


gag 


gcg 


t- f- r* 

L- O 


gtc 


gly leu 


gly 


asp 


val 


gly 


glu 


ala 

ct J. a 


phe 


vp 1 




















ccg ate 


ggg 


gtc 


tac 


gcg 


gtg 


aac 


U ci 


cca 


ser ile 


gly 


val 


tyr 


ala 


val 




uyx 


pro 


OT1 /Q1 


















aac ggt 


tec 


gat 


gat 


gcg 


age 


gec 


cac 




-a o r*i fx 1 \t 

cioii y ±y 


ser 


asp 


asp 


ala 


ser 


ala 


his 


ile 


^ ^ 1 / I 1 1 

O O X / 111 


















agg att 


gtg 


ctt 


ggt 




tat 


teg 


cag 


OCT t 


arg ile 


veil 


leu 


gly 


y j_y 


t" \f K 


ser 


gin 




391/131 


















atg ccg 


CCC 


gcg 


gtg 


gca 


gat 


cat 


gtc 


gee 


met pro 


pro 


ala 


val 


ala 


asp 


his 


val 


ala 


451/151 


















ggt ttc 


tec 


age 


atg 


ttg 


tgg 


ggc 


ggc 


ggg 


gly phe 


ser 


ser 


met 


leu 


trp 


gly 


gly 


gly 


511/171 


















tct aag 


acc 


ata 


aac 


ttg 


tgt 


get 


ccc 


gac 


ser lys 


thr 


ile 


asn 


leu 


cys 


ala 


pro 


asp 


571/191 


















atg gcg 


cat 


gtt 


teg 


tat 


gtt 


cag 


teg 


ggg 


met ala 


his 


val 


ser 


tyr 


val 


gin 


ser 


gly 


631/211 


















aac agg 


etc 


gat 


cac 


gee 


gga 


tga 






asn arg 


leu 


asp 


his 


ala 


gly 


OPA 







SEQ ID No.l3D 
FIGURE 13D 



REPLACEMENT SHEET (RULE 26) 



42/185 



Seql3F: ORF predicted by Cole et al . (Nature 393:537-544) and containing Rvl984c 
1/1 31/11 

tga ggc acg tea tgt etc age ggc cca ccg cca cct egg teg ccg gca gta tgt cag cat 
OPA gly thr ser cys leu ser gly pro pro pro pro arg ser pro ala val cys gin his 
61/21 91/31 

gtg cag atg act cca cgc age ctt gtt cgc ate gtt ggt gtc gtg gtt gcg acg acc ttg 
val gin met thr pro arg ser leu val arg ile val gly val val val ala thr thr leu 
121/41 151/51 

gcg ctg gtg age gca ccc gec ggc ggt cgt gec gcg cat gcg gat ccg tgt teg gac ate 
ala leu val ser ala pro ala gly gly arg ala ala his ala asp pro cys ser asp ile 
181/61 211/71 

gcg gtc gtt ttc get cgc ggc acg cat cag get tct ggt ctt ggc gac gtc ggt gag gcg 
ala val val phe ala arg gly thr his gin ala ser gly leu gly asp val gly glu ■ ala 
241/81 271/91 

ttc gtc gac teg ctt acc teg caa gtt ggc ggg egg teg att ggg gtc tac gcg gtg aac 
phe val asp ser leu thr ser gin val gly gly arg ser ile gly val tyr ala val asn 
301/101 331/111 

tac cca gca age gac gac tac cgc gcg age gcg tea aac ggt tec gat gat gcg age gec 
tyr pro ala ser asp asp tyr arg ala ser ala ser asn gly ser asp asp ala ser ala 
361/121 391/131 - ■ 

cac ate cag cgc acc gtc gee age tgc ccg aac acc agg att gtg ctt ggt ggc tat teg 
his ile gin arg thr val ala ser cys pro asn thr arg ile val leu gly gly tyr ser 
421/141 451/151 

cag ggt gcg acg gtc ate gat ttg tec acc teg gcg atg ccg ccc gcg gtg gca gat cat 
gin gly ala thr val ile asp leu ser thr ser ala met pro pro ala val ala asp his 
481/161 511/171 

gtc gee get gtc gee ctt ttc ggc gag cca tec agt ggt ttc tec age atg ttg tgg ggc 
val ala ala val ala leu phe gly glu pro ser ser gly phe ser ser met leu trp gly 
541/181 571/191 

ggc ggg teg ttg ccg aca ate ggt ccg ctg tat age tct aag acc ata aac ttg tgt get 
gly gly ser leu pro thr ile gly pro leu tyr ser ser lys thr ile asn leu cys ala 
601/201 631/211 

ccc gac gat cca ata tgc acc gga ggc ggc aat att atg gcg cat gtt teg tat gtt cag 
pro asp asp pro ile cys thr gly gly gly asn ile met ala his val ser tyr val gin 
661/221 691/231 

teg ggg atg aca age cag gcg gcg aca ttc gcg gcg aac agg etc gat cac gee gga tga 
ser gly met thr ser gin ala ala thr phe ala ala asn arg leu asp his ala gly OPA 



SEQ ID No.l3F 
FIGURE 13F 



REPLACEMENT SHEET (RULE 26) 



43/185 



31/11 



CCA CCG 


GGG 


CTG 


GAG 


GGG 


CGA 


ATG 


TGC 


GCC 


GAA CGC 


CGT 


CGG 


CCA 


ACT 


TGG 


CCG 


CTG 


AGG 


pro pro 


gly 


leu 


glu 


gly 


arg 


met 


cys 


ala 


glu arg 


arg 


arg 


pro 


thr 


trp 


pro 


leu 


arg 


61/21 


















91/31 


















GCG GCT 


GAT 


CCC 


CTG 


GCC 


CGA 


GAC 


GGG 


GCA 


AGC CAA 


TAG 


CGG 


CTC 


CAT 


CGG 


GCT 


TTG 


CTG 


ala ala 


asp 


pro 


leu 


ala 


arg 


asp 


gly 


ala 


ser gin 


AMB 


arg 


leu 


his 


arg 


ala 


leu 


leu 


121/41 














151/51 


















GTA GCG 


GTT 


CGG 


CGG 


GAA 


CCG 


AGC 


GCC 


GAC 


GTT GTC 


GGT 


GCC 


CGG 


TGA 


TAT 


ATT 


GGG 


TCA 


val ala 


val 


arg 


arg 


glu 


pro 


ser 


ala 


asp 


val val 


gly 


ala 


arg 


OPA 


tyr 


ile 


gly 


ser 


181/61 














211/71 


















GAC GGG 


TAT 


GGC 


GGC 


GAC 


TGA 


GGT 


GAT 


CTG 


CGA CAC 


GCC 


GCC 


GCG 


GTG 


CTC 


GAG 


CCA 


GGC 


asp gly 


tyr 


gly 


gly 


asp 


OPA gly 


asp 


leu 


arg his 


ala 


ala 


ala 


val 


leu 


glu 


pro 


gly 


241/81 
















271/91 


















TTA CGA 


CCA 


GGG 


AAT 


TTC 


GAA 


AAT 


GTT 


ATT 


CAG AAC 


ATC 


TTG 


TAT 


CTC 


TTC 


CTC 


CGT 


GCC 


leu arg 


pro 


gly 


asn 


phe 


glu 


asn 


val 


ile 


gin asn 


ile 


leu 


tyr 


leu 


phe 


leu 


arg 


ala 


301/101 
















331/111 


















ACC CCC 


TAG 


GTG 


TAG 


TGT 


TTT 


CGA 


GTA 


CCG 


GCA GAT 


CCC 


AGT 


TCA 


CCA 


GTC 


TCA 


CCA 


GAT 


thr pro 


AMB 


val 


AMB 


cys 


phe 


arg 


val 


pro 


ala asp 


pro 


ser 


ser 


pro 


val 


ser 


pro 


asp 



C 

SEQ ID No.l4A 
FIGURE 14A 



CAC CGG GGC TGG AGG GGC GAA 
his arg gly trp arg gly glu 

62/21 

CGG CTG ATC CCC TGG CCC GAG 
arg leu ile pro trp pro glu 
122/41 

TAG CGG TTC GGC GGG AAC CGA 
AMB arg phe gly gly asn arg 
182/61 

ACG GGT ATG GCG GCG ACT GAG 
thr gly met ala ala thr glu 
242/81 

TAC GAC CAG GGA ATT TCG AAA 
tyr asp gin gly ile ser lys 

302/101 

CCC CCT AGG TGT AGT GTT TTC 
pro pro arg cys ser val phe 









32/11 




TGT 


GCG 


CCG 


AAC GCC 


GTC 


cys 


ala 


pro 


asn ala 


val 






92/31 




ACG 


GGG 


CAA 


GCC AAT 


AGC 


thr 


gly 


gin 


ala asn 


ser 








152/51 




GCG 


CCG 


ACG 


TTG TCG 


GTG 


ala 


pro 


thr 


leu ser 


val 








212/71 




GTG 


ATC 


TGC 


GAC ACG 


CCG 


val 


ile 


cys 


asp thr 


pro 








272/91 




ATG 


TTA 


TTC 


AGA ACA 


TCT 


met 


leu 


phe 


arg thr 


ser 








332/111 




GAG 


TAC 


CGG 


CAG ATC 


CCA 


glu 


tyr 


arg 


gin ile 


pro 



SEQ ID N0.14B 
FIGURE 14B 



GGC CAA CTT GGC CGC TGA GGG 
gly gin leu gly arg OPA gly 

GGC TCC ATC GGG CTT TGC TGG 
gly ser ile gly leu cys trp 

CCC GGT GAT ATA TTG GGT CAG 
pro gly asp ile leu gly gin 

CCG CGG TGC TCG AGC CAG GCT 
pro arg cys ser ser gin ala 

TGT ATC TCT TCC TCC GTG CCA 
cys ile ser ser ser val pro 

GTT CAC CAG TCT CAC CAG ATC 
val his gin ser his gin ile 



REPLACEMENT SHEET (RULE 26) 



44/185 



33/11 



ALL bob 


O J. 


GGA 


GGG 


GCG 


AAT 


GTG 


CGC 


CGA 


ACG CCG 


TCG 


GCC 


AAC 


TTG 


GCC 


GCT 


GAG 


GGC 


thr gly 


ala 


CI 1 V 

y y 




ala 


asn 


val 


arg 


arg 


thr pro 


ser 


ala 


asn 


leu 


ala 


ala 


glu 


gly 




















93/31 
















GGT 


bbL i b/\ 




CCT 


GGC 


CCG 


AGA 


CGG 


GGC 


AAG 


CCA ATA 


GCG 


GCT 


CCA 


TCG 


GGC 


TTT 


GCT 


_T ADA 

g _L y yJ ir-rt. 


ser 


p r o 




pro 


ara 


arcr 


crl v 


lys 


pro ile 


ala 


ala 


pro 


ser 


gly 


phe 


ala 


gly 


123/41 
















153/51 
















AGA 


AGC GGT 


TCG 


GCG 


GGA 


ACC 


GAG 


CGC 


CGA 


CGT 


TGT CGG 


TGC 


CCG 


GTG 


ATA 


TAT 


TGG 


GTC 


ser gly 


ser 


ala 


gly 


thr 


glu 


arg 


arg 


arg 


cys arg 


cys 


pro 


val 


ile 


tyr 


trp 


val 


arg 


183/61 


















213/71 
















CTT 


CGG GTA 


TGG 


CGG 


CGA 


CTG 


AGG 


TGA 


TCT 


GCG 


ACA CGC 


CGC 


CGC 


GGT 


GCT 


CGA 


GCC 


AGG 


arg val 


trp 


arg 


arg 


leu 


arg 


OPA 


ser 


ala 


thr arg 


arg 


arg 


gly 


ala 


arg 


ala 


arg 


leu 


243/81 














273/91 
















CAC 


ACG ACC 


AGG 


GAA 


TTT 


CGA 


AAA 


TGT 


TAT 


TCA 


GAA CAT 


CTT 


GTA 


TCT 


CTT 


CCT 


CCG 


TGC 


thr thr 


arg 


glu 


phe 


arg 


lys 


cys 


tyr 


ser 


glu his 


leu 


val 


ser 


leu 


pro 


pro 


cys 


his 


303/101 














333/111 
















TC 


CCC CTA 


GGT 


GTA 


GTG 


TTT 


TCG 


AGT 


ACC 


GGC 


AGA TCC 


CAG 


TTC 


ACC 


AGT 


CTC 


ACC 


AGA 


pro leu 


gly 


val 


val 


phe 


ser 


ser 


thr 


gly 


arg ser 


gin 


phe 


thr 


ser 


leu 


thr 


arg 





SEQ ID No.l4C 
FIGURE 14C 



part of the nucleotide sequence of seq!4A 
1/1 31/11 

TTT TCG AGT ACC GGC AGA TCC CAG GTT CAC CAG GTC TCA CCA GAT C 
phe ser ser thr gly arg ser gin val his gin val ser pro asp 

SEQ ID NO-14A' 
FIGURE 14A f 



1/1 

TGT TTT CGA GTA CCG GCA GAT CCC 
cys phe arg val pro ala asp pro 



31/11 

AGG TTC ACC AGG TCT CAC CAG ATC 
arg phe thr arg ser his gin ile 



SEQ ID No.l4C 
FIGURE 14C 



1/1 

GTT TTC GAG TAC CGG CAG ATC CCA 
val phe glu tyr arg gin ile pro 



31/11 

GGT TCA CCA GGT CTC ACC AGA TC 
gly ser pro gly leu thr arg 

SEQ ID N0.14C 
FIGURE 14C 



REPLACEMENT SHEET (RULE 26) 



45/185 



ORF predicted based on the 
544) and containing seql4A' 

1/1 



TAG CGG 


TTC 


GGC 


GGG 


AAG 


CTA 


AMB arg 


phe 


gly 


gly 


lys 


leu 


61/21 












ACG GGT 


ATG 


GCG 


GCG 


GCT 


GAG 


thr gly 


met 


ala 


ala 


ala 


glu 


121/41 












TAC GAC 


CAG 


GGA 


ATT 


TCG 


AAA 


tyr asp 


gin 


gly 


ile 


ser 


lys 


181/61 












CCC CTA 


GGT 


GTA 


GTG 


TTT 


TCG 


pro leu 


gly 


val 


val 


phe 


ser 


241/81 












cca egg 


ggc 


gcg 


atg 


aac 


ttc 


pro arg 


gly 


ala 


met 


asn 


phe 


301/101 












tat gac 


ggg 


aat 


ctg 


gag 


cct 


tyr asp 


gly 


asn 


leu 


glu 


pro 


361/121 












teg ttg 


cca 


gat 


cct 


gtc 


aga 


ser leu 


pro 


asp 


pro 


val 


arg 


421/141 












gac cgt 


tta 


cac 


taa 






asp arg 


leu 


his 


OCH 







sequence published by 









31/11 




GCG 


GCG 


ACG 


TTG TCG 


GTG 


ala 


ala 


thr 


leu ser 


val 








91/31 




GTG 


ATC 


TGC 


GAC ACG 


CCG 


val 


ile 


cys 


asp thr 


pro 








151/51 




ATG 


TTA 


TTC 


AGA ACA 


TCT 


met 


leu 


phe 


arg thr 


ser 








211/71 




AGT 


ACC 


GGC 


AGA TCC 


CAG 


ser 


thr 


gly 


arg ser 


gin 








271/91 




ccg 


gca 


teg 


gca teg 


cca 


pro 


ala 


ser 


ala ser 


pro 








331/111 




tgt 


egg 


gec 


get caa 


cat 


cys 


arg 


ala 


ala gin 


his 








391/131 




ttc 


ccg 


att 


tec gca 


aag 


phe 


pro 


ile 


ser ala 


lys 



SEQ ID No.l4F 
FIGURE 14F 



e et al. (Nature 393:537- 



GCC GGT GAT ATA TTG GGT CAG 
ala gly asp ile leu gly gin 

CCG CGG TGC TCG AGC CAG GCT 
pro arg cys ser ser gin ala 

TGT ATC TCT TCT CCG TGC CAC 
cys ile ser ser pro cys his 

GTT CAC CAG GTC TCA CCA gat 
val his gin val ser pro asp 

ggt cga egg acg tgg teg cgc 
gly arg arg thr trp ser arg 

ate gaa gat gca eta ctt gag 
ile glu asp ala leu leu glu 

gag egg tac gec cat gac cgt 
glu arg tyr ala his asp arg 



REPLACEMENT SHEET (RULE 26) 



46/185 



Sequence Rv3054c predicted by Cole et al. (Nature 393:537-544) 
which may be in phase with Seql4A' 

1/1 

gtg tea gat acc aag tec gac ate aaa ate ttg gcc tta gtg gga age ctg cgc gcg gcg 
val ser asp thr lys ser asp ile lys ile leu ala leu val gly ser leu arg ala ala 
61/21 

teg ttc aac cgc cag ate gcc gag ctg get gcc aag gtc get ccg gac ggc gtc acc gtc 
ser phe asn arg gin ile ala glu leu ala ala lys val ala pro asp gly val thr val 
121/41 151/51 

acc atg ttc gag ggg ctg ggg gac ctg ccg ttc tac aac gaa gac ate gac aca gcg acg 
thr met phe glu gly leu gly asp leu pro phe tyr asn glu asp ile asp thr ala thr 
181/61 211/71 

gag gtg ccg gcg ccg gtg age gcg ttg egg gag gcc gcg tct gac gcg cac get gcc ttg 
glu val pro ala pro val ser ala leu arg glu ala ala ser asp ala his ala ala leu 
241/81 271/91 

gtg gtc acg ccg gaa tac aac ggc age att ccg gcc gtg ate aag aac gcg ate gac tgg 
val val thr pro glu tyr asn gly ser ile pro ala val ile lys asn ala ile asp trp 
301/101 331/111 

ctg tec agg cca ttc ggc gat ggc gcg ttg aag gac aag ccg ttg gcc gtg ate ggc ggc 
leu ser arg pro phe gly asp gly ala leu lys asp lys pro leu ala val xle gly gly 
361/121 391/131 

tec atg ggc cgc tac ggc ggg gta tgg gcg cac gac gag act cgc aag teg ttc age ate 
ser met gly arg tyr gly gly val trp ala his asp glu thr arg lys ser phe ser lie 
421/141 451/151 

get ggc acg egg gtg gtc gat gcg ate aaa ctg teg gtg ccg ttc caa act ctg ggc aag 
ala gly thr arg val val asp ala ile lys leu ser val pro phe gin thr leu gly lys 
481/161 511/171 

teg gtc gcg gac gac gcc ggg ctg gcg gcg aat gtg cgc gac gcc gtc ggc aac ttg gcc 
ser val ala asp asp ala gly leu ala ala asn val arg asp ala val gly asn leu ala 
541/181 

get gag gtc ggc tga 
ala glu val gly OPA 

SEQ ID No.l4R 
FIGURE 14R 



REPLACEMENT SHEET (RULE 26) 



47/185 



ORF predicted by Cole et al . (Nature 393:537-544) and containing Rv3054c 
1/1 31/11 

taa cqc gat egg aat aaa teg gac cat ggt ccg gtt ggc teg tgc aag gac gtg gac caa 
OCH arg asp arg asn lys ser asp his gly pro val gly ser cys lys asp val asp gin 

9 1 / 3 X 

caa gcg gaa agg aac gta gca gtg tea gat ace aag tec gac ate aaa ate ttg gee tta 
aln ala glu arg asn val ala val ser asp thr lys ser asp ile lys lie leu ala leu 

151/51 

gtg gga age ctg cgc gcg gcg teg ttc aac cgc cag ate gec gag ctg get gee aag gtc 
val gly ser leu arg ala ala ser phe asn arg gin ile ala glu leu ala ala lys val 
181/61 211/71 

act ccg gac ggc gtc acc gtc acc atg ttc gag ggg ctg ggg gac ctg ccg ttc tac aac 
ala pro asp gly val thr val thr met phe |1« |JV leu glv as P leu pro phe tyr aSn 

gaa'gac ate gac aca gcg acg gag gtg ccg gcg ccg gtg age gcg ttg egg gag gee gcg 
glu asp ile asp thr ala thr glu val pro ^la^pro val ser ala leu arg glu ala ala 

tct/gac gcg cac get gee ttg gtg gtc acg ccg gaa tac aac ggc age att ccg gee gtg 
ser asp ala his ala ala leu val val thr pro glu tyr asn gly ser lie pro ala val 
361/121 391/131 

ate aag aac gcg ate gac tgg ctg tec agg cca ttc ggc gat ggc gcg ttg aag gac aag 
ile lys asn ala ile asp trp leu ser arg pro phe gly asp gly ala leu lys asp lys 
421/141 451/151 

ccg ttg gee gtg ate ggc ggc tec atg ggc cgc tac ggc ggg gta tgg gcg cac gac gag 
pro leu ala val ile gly gly ser met gly arg tyr gly gly val trp ala his asp glu 
481/161 511/171 

act cgc aag teg ttc age ate get ggc acg egg gtg gtc gat gcg ate aaa ctg teg gtg 
thr arg lys ser phe ser ile ala gly thr arg val val asp ala ile lys leu ser val 

181 571/191 
ccg ttc caa act ctg ggc aag teg gtc gcg gac gac gec ggg ctg gcg gcg aat gtg cgc 
pro phe gin thr leu gly lys ser val ala asp asp ala gly leu ala ala asn val arg 
601/201 631/211 
gac gec gtc ggc aac ttg gec get gag gtc ggc tga 
asp ala val gly asn leu ala ala glu val gly OPA 



SEQ ID No.l4P 



FIGURE 14P 



REPLACEMENT SHEET (RULE 26) 



48/185 



fragment based on the sequence published by Cole et al . (Nature 393:537-54 4) and 
containing seq 14F' and seq 14P" 

1/1 31/11 

taa cgc gat egg aat aaa teg gac cat ggt ccg gtt ggc teg tgc aag gac gtg gac caa 
OCH arg asp arg asn lys ser asp his gly pro val gly ser cys lys asp val asp gin 
asn ala ile gly ile asn arg thr met val arg leu ala arg ala arg thr trp thr asn 
thr arg ser glu OCH ile gly pro trp ser gly trp leu val gin gly arg gly pro thr 
61/21 91/31 

caa gcg gaa agg aac gta gca gtg tea gat acc aag tec gac ate aaa ate ttg gec tta 
gin ala glu arg asn val ala val ser asp thr lys ser asp lie lys He leu ala leu 
lys arg lys gly thr AMB gin cys gin ile pro ser pro thr ser lys ser trp pro AMB 
ser gly lys glu arg ser ser val arg tyr gin val arg his gin asn leu gly leu ser 
121/41 151/51 

gtg gga age ctg cgc gcg gcg teg ttc aac cgc cag ate gee gag ctg get gec aag gtc 
val gly ser leu arg ala ala ser phe asn arg gin ile ala glu leu ala ala lys val 
trp glu ala cys ala arg arg arg ser thr ala arg ser pro ser trp leu pro arg ser 
aly lys pro ala arg gly val val gin pro pro asp arg arg ala gl^ cys gin gly arg 
181/61 211/71 

act ccg gac ggc gtc acc gtc acc atg ttc gag ggg ctg ggg gac ctg ccg ttc tac aac 
ala pro asp gly val thr val thr met phe glu gly leu gly asp leu pro phe tyr asn 
leu arg thr ala ser pro ser pro cys ser arg gly trp gly thr cys arg ser thr thr 
ser gly arg arg his arg his his val arg gly ala gly gly pro ala val leu gin arg 
241/81 271/91 

gaa gac ate gac aca gcg acg gag gtg ccg gcg ccg gtg age gcg ttg egg gag gec gcg 
qlu asp ile asp thr ala thr glu val pro ala pro val ser ala leu arg glu ala ala 
lys thr ser thr gin arg arg arg cys arg arg arg OPA ala arg cys gly arg pro arg 
arg his arg his ser asp gly gly ala gly ala gly glu arg val ala gly gly arg val 
301/101 331/111 

tct gac gcg cac get gee ttg gtg gtc acg ccg gaa tac aac ggc age att ccg gec gtg 
ser asp ala his ala ala leu val val thr pro glu tyr asn gly ser ile pro ala val 
leu thr arg thr leu pro trp trp ser arg arg asn thr thr ala ala phe arg pro OPA 
OPA arg ala arg cys leu gly gly his ala gly ile gin arg gin his ser gly arg asp 
361/121 391/131 

ate aag aac gcg ate gac tgg ctg tec agg cca ttc ggc gat ggc gcg ttg aag gac aag 
ile lys asn ala ile asp trp leu ser arg pro phe gly asp gly ala leu lys asp lys 
ser arg thr arg ser thr gly cys pro gly his ser ala met ala arg OPA arg thr ser 
gin glu arg asp arg leu ala val gin ala ile arg arg trp arg val glu gly gin ala 
421/141 451/151 

ccg ttg gec gtg ate ggc ggc tec atg ggc cgc tac ggc ggg gta tgg gcg cac gac gag 
pro leu ala val ile gly gly ser met gly arg tyr gly gly val trp ala his asp glu 
arq trp pro OPA ser ala ala pro trp ala ala thr ala gly tyr gly arg thr thr arg 
val gly arg asp arg arg leu his gly pro leu arg arg gly met gly ala arg arg asp 
481/161 511/171 

act cgc aag teg ttc age ate get ggc acg egg gtg gtc gat gcg ate aaa ctg teg gtg 
thr arg lys ser phe ser ile ala gly thr arg val val asp ala ile lys leu ser val 
leu ala ser arg ser ala ser leu ala arg gly trp ser met arg ser asn cys arg cys 
ser gin val val gin his arg trp his ala gly gly arg cys asp gin thr val gly ala 
541/181 571/191 

ccg ttc caa act ctg ggc aag teg gtc gcg gac gac gec ggg ctg gcg gcg aat gtg cgc 
pro phe gin thr leu gly lys ser val ala asp asp ala gly leu ala ala asn val arg 
arq ser lys leu trp ala ser arg ser arg thr thr pro gly trp arg arg met cys ala 
val pro asn ser gly gin val gly arg gly arg arg arg ala gly gly glu cys ala arg 

601/201 631/211 

gac gee gtc ggc aac ttg gec get gag gtc ggc tga tec ctg ggc cga ggc ggg tea gec 
asp ala val gly asn leu ala ala glu val gly OPA ser leu gly arg gly gly ser ala 
thr pro ser ala thr trp pro leu arg ser ala asp pro trp ala glu ala gly gin pro 
arg arg arg gin leu gly arg OPA gly arg leu ile pro gly pro arg arg val ser gin 
661/221 691/231 

aat age ggc tec ate ggc ttt get ggt age ggt teg gcg gga age tag egg cga cgt tgt 
asn ser gly ser ile gly phe ala gly ser gly ser ala gly ser AMB arg arg arg cys 
ile ala ala pro ser ala leu leu val ala val arg arg glu ala ser gly asp val val 
AMB arg leu his arg leu cys trp AMB arg phe gly gly lys leu ala ala thr leu ser 



SEQ ID No.l4Q 



FIGURE 14Q 
REPLACEMENT SHEET (RULE 26) 



49/185 

721/241 751/251 

egg tgg ccg gtg ata tat tgg gtc aga egg gta tgg egg egg ctg agg tga tct geg aca 
arg trp pro val ile tyr trp val arg arg val trp arg arg leu arg OPA ser ala thr 
alv gly arg OPA tyr ile gly ser asp gly tyr gly gly gly OPA gly asp leu arg his 
val ala gly asp ile leu gly gin thr gly met ala ala ala glu val lie cys asp thr 
781/261 811/271 

cac cgc cgc ggt get cga gec agg ctt acg acc agg gaa ttt cga aaa tgt tat tea gaa 
arg arg arg gly ala arg ala arg leu thr thr arg glu phe arg lys cys tyr ser glu 
ala ala ala val leu glu pro gly leu arg pro gly asn phe glu asn val ile gin asn 
pro pro arg cys ser ser gin ala tyr asp gin gly ile ser lys met leu phe arg thr 
841/281 871/291 

cat ctt qta tct ctt etc cgt gee acc ccc tag gtg tag tgt ttt cga gta ccg gca gat 
his leu val ser leu leu arg ala thr pro AMB val AMB cys phe arg val pro ala asp 
ile leu tyr leu phe ser val pro pro pro arg cys ser val phe glu tyr arg gin lie 
ser cys ile ser ser pro cys his pro leu gly val val phe ser ser thr gly arg ser 
901/301 931/311 

ccc agg ttc acc agg tct cac cag ate cac ggg geg cga tga act tec egg cat egg cat 
pro arg phe thr arg ser his gin ile his gly ala arg OPA thr ser arg his arg his 
pro gly ser pro gly leu thr arg ser thr gly arg asp glu leu pro gly ile gly lie 
gin val his gin val ser pro asp pro arg gly ala met asn phe pro ala ser ala ser 
961/321 991/331 

cgc cag gtc gac gga cgt ggt cgc get atg acg gga ate tgg age ctt gtc ggg ccg etc 
arg gin val asp gly arg gly arg ala met thr gly ile trp ser leu val gly pro leu 
ala arg ser thr asp val val ala leu OPA arg glu ser gly ala leu ser gly arg ser 
pro gly arg arg thr trp ser arg tyr asp gly asn leu glu pro cys arg ala ala gin 
1021/341 1051/351 

aac ata teg aag atg cac tac ttg agt cgt tgc cag ate ctg tea gat tec cga ttt ccg 
asn ile ser lys met his tyr leu ser arg cys gin ile leu ser asp ser arg phe pro 
thr tyr arg arg cys thr thr OPA val val ala arg ser cys gin lie pro asp phe arg 
his ile glu asp ala leu leu glu ser leu pro asp pro val arg phe pro lie ser ala 
1081/361 1111/371 
caa agg age ggt acg ccc atg acc gtg acc gtt tac act aa 
gin arg ser gly thr pro met thr val thr val tyr thr 
lys gly ala val arg pro OPA pro OPA pro phe thr leu 
lys glu arg tyr ala his asp arg asp arg leu his OCH 

SEQ ID No. 14Q (continued) 
FIGURE 14Q (continued) 



l/l 

CAA GCC 
gin ala 
61/21 
GCG CAC 
ala his 
121/41 
CGA TGC 
arg cys 
181/61 
GGC gg; 
gly gl! 
241/81 













31/11 


















CTG 


TTT 


GCC 


GTT 


TTG 


GGG CTC 


CTA 


CCA 


GAA 


CAC 


CAC 


CTG 


GCG 


GCC 


leu 


phe 


ala 


val 


leu 


gly leu 


leu 


pro 


glu 


his 


his 


leu 


ala 


ala 










91/31 
















CGT 


CAC 


CAG 


TTG 


CGA 


TCG 


GTT CCT 


CCC 


GCG 


CGC 


GGG 


CGG 


CGA 


CGA 


his 


gin 


leu 


arg 


ser 


val pro 


pro 


ala 


arg 


gly 


arg 


arg 


arg 


arg 










151/51 


















GGC 


GGC 


GCA 


GCT 


GCG 


TAG CTC 


GAC 


CCG 


GTC 


GAC 


GAC 


GAC 


GGG 


GTC 
val 


gly 


gly 


ala 


ala 


ala 


AMB leu 


asp 


pro 


val 


asp 


asp 


asp 


gly 








211/71 
















CAC 
his 


GAT 


GTC 


GAG 


GCG 


ATG 


GCA ATA 


CAG 


CGC 


CTT 


GGT 


GCG 


CGG 


CCA 


asp 


val 


glu 


ala 


met 


ala ile 


gin 


arg 


leu 


gly 


ala 


arg 


pro 








271/91 
















TAC 


GAC 


CAG 


TCC 


CGC 


GCC 


CAC CGG 


CAG 


CCG 


GAT 


CCG 


GAT 


ACG 


CGG 


asp 


gin 


ser 


arg 


ala 


his arg 


gin 


pro 


asp 


pro 


asp 


thr 


arg 


tyr 



SEQ ID No.lSA 

FIGURE 15A 
REPLACEMENT SHEET (RULE 26) 



50/185 





















32/11 


















AAG CCC 


GGC 


CGC 


GAC 


TGT 


TTG 


CCG 


TTT 


TGG 


GGC TCC 


TAC 


CAG 


AAC 


ACC 


ACC 


TGG 


CGG 


CCG 


lys pro 


gly 


arg 


asp 


cys 


leu 


pro 


phe 


trp 


gly ser 


tyr 


gin 


asn 


thr 


thr 


trp 


arg 


pro 


62/21 


















92/31 














GAC 


GTC 


CGC ACC 


ATG 


GTG 


TGC 


ACC 


AGT 


TGC 


GAT 


CGG 


TTC CTC 


CCG 


CGC 


GCG 


GGC 


GGC 


GAC 


arg thr 


met 


val 


cys 


thr 


ser 


cys 


asp 


arg 


phe leu 


pro 


arg 


ala 


gly 


gly 


asp 


asp 


val 


122/41 


















152/51 
















TCG 


GAT GCC 


CGC 


GCC 


CCG 


GCG 


GCG 


CAG 


CTG 


CGT 


AGC TCG 


ACC 


CGG 


TCG 


ACG 


ACG 


ACG 


GGG 


asp ala 


arg 


ala 


pro 


ala 


ala 


gin 


leu 


arg 


ser ser 


thr 


arg 


ser 


thr 


thr 


thr 


gly 


ser 


182/61 














212/71 
















ACG 


GCG GAC 


CAG 


TCG 


GCG 


ATG 


TCG 


AGG 


CGA 


TGG 


CAA TAC 


AGC 


GCC 


TTG 


GTG 


CGC 


GGC 


CAC 


ala asp 


gin 


ser 


ala 


met 


ser 


arg 


arg 


trp 


gin tyr 


ser 


ala 


leu 


val 


arg 


gly 


his 


thr 


242/81 
















272/91 


















TCT GAG 


GTG 


GCG 


AAG 


ACC 


AGT 


CCC 


GCG 


CCC 


ACC GGC 


AGC 


CGG 


ATC 


CGG 


ATA 


CGC 


GGT 


AC 


ser glu 


val 


ala 


lys 


thr 


ser 


pro 


ala 


pro 


thr gly 


ser 


arg 


ile 


arg 


ile 


arg 


gly 





SEQ ID No.lSB 
FIGURE 15B 



33/11 



AGC CCG 


GCC 


GCG 


ACT 


GTT 


TGC 


CGT 


TTT 


GGG 


GCT CCT 


ACC 


AGA 


ACA 


CCA 


CCT 


GGC 


GGC 


CGC 


ser pro 


ala 


ala 


thr 


val 


cys 


arg 


phe 


gly 


ala pro 


thr 


arg 


thr 


pro 


pro 


gly 


gly 


arg 


63/21 


















93/31 














ACG 


TCG 


GCA CCA 


TGG 


TGT 


GCA 


CCA 


GTT 


GCG 


ATC 


GGT 


TCC TCC 


CGC 


GCG 


CGG 


GCG 


GCG 


ACG 


ala pro 


trp 


cys 


ala 


pro 


val 


ala 


ile 


giy 


ser ser 


arg 


ala 


arg 


ala 


ala 


thr 


thr 


ser 


123/41 














153/51 












CGG 


GGT 


CGG 


ATG CCC 


GCG 


CCC 


CGG 


CGG 


CGC 


AGC 


TGC 


GTA 


GCT CGA 


CCC 


GGT 


CGA 


CGA 


CGA 


met pro 


ala 


pro 


arg 


arg 


arg 


ser 


cys 


val 


ala arg 


pro 


gly 


arg 


arg 


arg 


arg 


gly 


arg 


183/61 
















213/71 














ACA 


CGT 


CGG ACC 


AGT 


CGG 


CGA 


TGT 


CGA 


GGC 


GAT 


GGC 


AAT ACA 


GCG 


CCT 


TGG 


TGC 


GCG 


GCC 


arg thr 


ser 


arg 


arg 


cys 


arg 


gly 


asp 


gly 


asn thr 


ala 


pro 


trp 


cys 


ala 


ala 


thr 


arg 


243/81 
















273/91 












GCG 


GTA 




CTG AGG 


TGG 


CGA 


AGA 


CCA 


GTC 


CCG 


CGC 


CCA 


CCG GCA 


GCC 


GGA 


TCC 


GGA 


TAC 


C 


leu arg 


trp 


arg 


arg 


pro 


val 


pro 


arg 


pro 


pro ala 


ala 


gly 


ser 


gly 


tyr 


ala 


val 





SEQ ID No.ISC 
FIGURE 15C 



REPLACEMENT SHEET (RULE 26) 



51/185 



part of the nucleotide sequence of seq!5A 



1/1 


















31/11 


















GGC GGC 


CGC 


GCG 


CCA 


TGG 


TGT 


GCA 


CCA 


GTT 


GCG ATC 


GGT 


TCT 


CCC 


GCG 


CGC 


GGG 


CGG 


CGA 


gly gly 


arg 


ala 


pro 


trp 


cys 


ala 


pro 


val 


ala ile 


gly 


ser 


pro 


ala 


arg 


gly 


arg 


arg 


61/21 
















91/31 


















CGA CGT 


CGA 


TGG 


CCG 


CGC 


CCC 


GGC 


GGC 


TGC 


AGC TGC 


GTA 


GCT 


CGA 


CCC 


GGT 


CGA 


CGA 


CGA 


arg arg 


arg 


trp 


pro 


arg 


pro 


gly 


gly 


cys 


ser cys 


val 


ala 


arg 


pro 


gly 


arg 


arg 


arg 


121/41 


















151/51 


















CGG GGT 


CGG 


CGG 


GCC 


AGT 


CGG 


CGA 


TGT 


CGA 


GGC GAT 


GGC 


AAT 


ACA 


GCG 


CCT 


TGG 


TGC 


GCG 


arg gly 


arg 


arg 


ala 


ser 


arg 


arg 


cys 


arg 


gly asp 


gly 


asn 


thr 


ala 


pro 


trp 


cys 


ala 


181/61 
















211/71 


















GCC ACA 


CGT 


CTG 


AGG 


TGG 


CGA 


AGA 


CCA 


GTC 


CCG CGC 


CCA 


CCG 


GCA 


GCC 


GGA 


TC 






ala thr 


arg 


leu 


arg 


trp 


arg 


arg 


pro 


val 


pro arg 


pro 


pro 


ala 


ala 


gly 









SEQ ID No. ISA f 
FIGURE 15A f 



1/1 

GCG GCC GCG CGC 
ala ala ala arg 
61/21 

GAC GTC GAT GGC 
asp val asp gly 
121/41 

GGG GTC GGC GGG 
gly val gly gly 
181/61 

CCA CAC GTC TGA 
pro his val OPA 



CAT GGT GTG CAC 
his gly val his 

CGC GCC CCG GCG 
arg ala pro ala 

CCA GTC GGC GAT 
pro val gly asp 

GGT GGC GAA GAC 
gly gly glu asp 







31/11 


CAG 


TTG 


CGA TCG 


gin 


leu 


arg ser 






91/31 


GCT 


GCA 


GCT GCG 


ala 


ala 


ala ala 






151/51 


GTC 


GAG 


GCG ATG 


val 


glu 


ala met 






211/71 


CAG 


TCC 


CGC GCC 


gin 


ser 


arg ala 



SEQ ID No.lSB 
FIGURE 15B' 



GTT CTC CCG CGC 

val leu pro arg 

TAG CTC GAC CCG 

AMB leu asp pro 

GCA ATA CAG CGC 

ala ile gin arg 

CAC CGG CAG CCG 

his arg gin pro 



GCG GGC GGC GAC 
ala gly gly asp 

GTC GAC GAC GAC 
val asp asp asp 

CTT GGT GCG CGG 
leu gly ala arg 

GAT C 
asp 



1/1 

TGG CGG CCG CGC GCC ATG GTG 
trp arg pro arg ala met val 
61/21 

ACG ACG TCG ATG GCC GCG CCC 
thr thr ser met ala ala pro 
121/41 

ACG GGG TCG GCG GGC CAG TCG 
thr gly ser ala gly gin ser 
181/61 

GGC CAC ACG TCT GAG GTG GCG 
gly his thr ser glu val ala 



31/11 

TGC ACC AGT TGC GAT CGG TTC 
cys thr ser cys asp arg phe 
91/31 

CGG CGG CTG CAG CTG CGT AGC 
arg arg leu gin leu arg ser 
151/51 

GCG ATG TCG AGG CGA TGG CAA 
ala met ser arg arg trp gin 

211/71 

AAG ACC AGT CCC GCG CCC ACC 
lys thr ser pro ala pro thr 



TCC CGC GCG CGG GCG GCG 
ser arg ala arg ala ala 

TCG ACC CGG TCG ACG ACG 
ser thr arg ser thr thr 

TAC AGC GCC TTG GTG CGC 
tyr ser ala leu val arg 

GGC AGC CGG ATC 
gly ser arg iie 



SEQ ID No.l5C f 

FIGURE 15C* 
REPLACEMENT SHEET (RULE 26) 



52/185 



ORF containing Seql5A' according to Cole et al . (Nature 393:537-544) 
1/1 31/11 

taa ggt ccg cca acg ctt tac get cga egg ccg cca cga gtt ggc egg cca ctt tea ggc 
OCH gly pro pro thr leu tyr ala arg arg pro pro arg val gly arg pro leu ser gly 
61/21 91/31 

cgt agt cgc cgc agg gca ggg ctt ccc gcg teg tct teg egg gtt tgt egg caa agg tgt 
arg ser arg arg arg ala gly leu pro ala ser ser ser arg val cys arg gin arg cys 
121/41 151/51 

agg ggt age gtt cgt ggg cgt cga cga cga tgt gca get egg gga tgc egg egg cgc ggg 
arg gly ser val arg gly arg arg arg arg cys ala ala arg gly cys arg arg arg gly 
181/61 211/71 

egg tgg ggg tgc gca cgc ccg gee gcg act gtt tgc gcg ttt tgg ggc tct gec aga aca 
arg trp gly cys ala arg pro ala ala thr val cys ala phe trp gly ser ala arg thr 
241/81 271/91 

cca cct ggc ggc cgc gcg cca tgg tgt gca cca gtt gcg ate ggt tct ccc gcg cgc ggg 
pro pro gly gly arg ala pro trp cys ala pro val ala ile gly ser pro ala arg gly 
301/101 331/111 

egg cga cga cgt cga tgg ccg cgc ccc ggc ggc tgc age tgc gta get cga ccc ggt cga 
arq arg arg arg arg trp pro arg pro gly gly cys ser cys val ala arg pro gly arg 
361/121 391/131 

cga cga egg ggt egg egg gee agt egg cga tgt cga ggc gat ggc aat aca gcg cct tgg 
arg arg arg gly arg arg ala ser arg arg cys arg gly asp gly asn thr ala pro trp 
421/141 451/151 

tgc gcg gec aca cgt ctg agg tgg cga aga cca gtc ccg cgc cca ccg gca gec gga tea 
cys ala ala thr arg leu arg trp arg arg pro val pro arg pro pro ala ala gly ser 
481/161 511/171 

ggt agg gca ggc gcg agt ctt cag egg ggt tgg egg cga cga gca get cca cag agt gtg 
qly arg ala gly ala ser leu gin arg gly trp arg arg arg ala ala pro gin ser val 
541/181 571/191 

agg gta egg gcg gcg tac ggc aac ggt gaa gca ggc act ccg acg aac cca teg tea cgt 
arg val arg ala ala tyr gly asn gly glu ala gly thr pro thr asn pro ser ser arg 
601/201 

cga agg ggc agg tga 
arg arg gly arg OPA 



SEQ ID N0.15F 
FIGURE 15F 



REPLACEMENT SHEET (RULE 26) 



53/185 



R:Rv2530c predicted according to Cole et al . (Nature 393:537-544) and which may 
be in phase with SEQ15A 

1/1 31/11 



gtQ acq 


gca 


ctg 


etc 


gat 


gtc 


aat 


gtg 


ctg 


ate gcg 


ctg 


ggc 


tgg 


ccg 


aat 


cac 


gtt 


cac 


val thr 


ala 


leu 


leu 


asp 


val 


asn 


val 


leu 


ile ala 


leu 


gly 


trp 


pro 


asn 


his 


val 


his 


61/21 


















91/31 


















cat gcg 


gec 


gcg 


cag 


cga 


tgg 


ttc 


acg 


cag 


ttc tec 


teg 


aat 


ggg 


tgg 


gee 


acc 


acg 


ccg 


his ala 


ala 


ala 


gin 


arg 


trp 


phe 


thr 


gin 


phe ser 


ser 


asn 


gly 


trp 


ala 


thr 


thr 


pro 


121/41 


















151/51 


















ate acc 


gag 


gca 


ggg 


tat 


gtc 


cga 


att 


tea 


age aat 


cgc 


agt 


gtg 


atg 


cag 


gtg 


teg 


acc 


ile thr 


glu 


ala 


gly 


tyr 


val 


arg 


ile 


ser 


ser asn 


arg 


ser 


val 


met 


gin 


val 


ser 


thr 


181/61 
















211/71 


















acg ccg 


get 


ate 


gcg 


ate 


get 


cag 


ttg 


gcg 


gcg atg 


act 


tct 


ctt 


gee 


ggg 


cac 


acg 


ttt 


thr pro 


ala 


ile 


ala 


ile 


ala 


gin 


leu 


ala 


ala met 


thr 


ser 


leu 


ala 


gly 


his 


thr 


phe 


241/81 


















271/91 


















tgg cct 


gac 


gat 


gtg 


cca 


ctg 


ate 


gtt 


ggg 


age gec 


ggc 


gat 


cgc 


gat 


gcg 


gtg 


tec 


aac 


trp pro 


asp 


asp 


val 


pro 


leu 


ile 


val 


gly 


ser ala 


gly 


asp 


arg 


asp 


ala 


val 


ser 


asn 


301/101 














331/111 


















cac cgt 


egg 


gtc 


acc 


gac 


tgc 


cat 


etc 


ate 


gec ttg 


gee 


gcg 


cgc 


tac 


ggg 


ggc 


egg 


ttg 


his arg 


arg 


val 


thr 


asp 


cys 


his 


leu 


ile 


ala leu 


ala 


ala 


arg 


tyr 


gly 


gly 


arg 


leu 


361/121 
















391/131 


















gtc aca 


ttc 


gat 


gee 


gca 


ctg 


gee 


gat 


tea 


gca tec 


gca 


ggc 


etc 


gtc 


gag 


gtg 


ttg 


tag 


val thr 


phe 


asp 


ala 


ala 


leu 


ala 


asp 


ser 


ala ser 


ala 


gly 


leu 


val 


glu 


val 


leu 


AMB 



SEQ ID No. 15R 



FIGURE 15R 



SeqlSP: ORF according to Cole et al. (Nature 393:537-544) containing Rv2530c 
1/1 31/11 

tga tgt tec gee gga tgc gee gac ggt gac ttc cga gga tgt cgt ccg cgc get cga gga 
OPA cys ser ala gly cys ala asp gly asp phe arg gly cys arg pro arg ala arg gly 
61/21 91/31 

cga cgt gtg acg gca ctg etc gat gtc aat gtg ctg ate gcg ctg ggc tgg ccg aat cac 
arg arg val thr ala leu leu asp val asn val leu ile ala leu gly trp pro asn his 
121/41 151/51 

gtt cac cat gcg gee gcg cag cga tgg ttc acg cag ttc tec teg aat ggg tgg gec acc 
val his his ala ala ala gin arg trp phe thr gin phe ser ser asn gly trp ala thr 
181/61 211/71 

acg ccg ate acc gag gca ggg tat gtc cga att tea age aat cgc agt gtg atg cag gtg 
thr pro ile thr glu ala gly tyr val arg ile ser ser asn arg ser val met gin val 
241/81 271/91 

teg acc acg ccg get ate gcg ate get cag ttg gcg gcg atg act tct ctt gec ggg cac 
ser thr thr pro ala ile ala ile ala gin leu ala ala met thr ser leu ala gly his 
301/101 331/111 

acg ttt tgg cct gac gat gtg cca ctg ate gtt ggg age gee ggc gat cgc gat gcg gtg 
thr phe trp pro asp asp val pro leu ile val gly ser ala gly asp arg asp ala val 
361/121 391/131 

tec aac cac cgt egg gtc acc gac tgc cat etc ate gee ttg gee gcg cgc tac ggg ggc 
ser asn his arg arg val thr asp cys his leu ile ala leu ala ala arg tyr gly gly 
421/141 451/151 

egg ttg gtc aca ttc gat gee gca ctg gec gat tea gca tec gca ggc etc gtc gag gtg 
arg leu val thr phe asp ala ala leu ala asp ser ala ser ala gly leu val glu val 
481/161 
ttg tag 
leu AMB 

SEQ ID No.ISP 



FIGURE 15P 
REPLACEMENT SHEET (RULE 26) 



54/185 



Fragment containing SeqlSP 1 and Seq 15F 1 
1/1 31/11 

tga tgt tec gec gga tgc gec gac ggt gac ttc cga gga tgt cgt ccg cgc get cga gga 
OPA cys ser ala gly cys ala asp gly asp phe arg gly cys arg pro arg ala arg gly 
asp val pro pro asp ala pro thr val thr ser glu asp val val arg ala leu glu asp 
Met phe arg arg met arg arg arg OPA leu pro arg met ser ser ala arg ser arg thr 
61/21 91/31 

cga cgt gtg acg gca ctg etc gat gtc aat gtg ctg ate gcg ctg ggc tgg ccg aat cac 
arg arg val thr ala leu leu asp val asn val leu ile ala leu gly trp pro asn his 
asp val OPA arg his cys ser met ser met cys OPA ser arg trp ala gly arg ile thr 
thr cys asp gly thr ala arg cys gin cys ala asp arg ala gly leu ala glu ser arg 
121/41 151/51 

gtt cac cat gcg gee gcg cag cga tgg ttc acg cag ttc tec teg aat ggg tgg gec acc 
val his his ala ala ala gin arg trp phe thr gin phe ser ser asn gly trp ala thr 
phe thr met arg pro arg ser asp gly ser arg ser ser pro arg met gly gly pro pro 
ser pro cys gly arg ala ala met val his ala val leu leu glu trp val gly his his 
181/61 211/71 

acg ccg ate acc gag gca ggg tat gtc cga att tea age aat cgc agt gtg atg cag gtg 
thr pro ile thr glu ala gly tyr val arg ile ser ser asn arg ser val met gin val 
arg arg ser pro arg gin gly met ser glu phe gin ala ile ala val OPA cys arg cys 
ala asp his arg gly arg val cys pro asn phe lys gin ser gin cys asp ala gly val 
241/81 271/91 

teg acc acg ccg get ate gcg ate get cag ttg gcg gcg atg act tct ctt gee ggg cac 
ser thr thr pro ala ile ala ile ala gin leu ala ala met thr ser leu ala gly his 
arg pro arg arg leu ser arg ser leu ser trp arg arg OPA leu leu leu pro gly thr 
asp his ala gly tyr arg asp arg ser val gly gly asp asp phe ser cys arg ala his 
301/101 331/111 

acg ttt tgg cct gac gat gtg cca ctg ate gtt ggg age gec ggc gat cgc gat gcg gtg 
thr phe trp pro asp asp val pro leu ile val gly ser ala gly asp arg asp ala val 
arg phe gly leu thr met cys his OPA ser leu gly ala pro ala ile ala met arg cys 
val leu ala OPA arg cys ala thr asp arg trp glu arg arg arg ser arg cys gly val 
361/121 391/131 

tec aac cac cgt egg gtc acc gac tgc cat etc ate gec ttg gee gcg cgc tac ggg ggc 
ser asn his arg arg val thr asp cys his leu ile ala leu ala ala arg tyr gly gly 
pro thr thr val gly ser pro thr ala ile ser ser pro trp pro arg ala thr gly ala 
gin pro pro ser gly his arg leu pro ser his arg leu gly arg ala leu arg gly pro 
421/141 451/151 

egg ttg gtc aca ttc gat gec gca ctg gee gat tea gca tec gca ggc etc gtc gag gtg 
arg leu val thr phe asp ala ala leu ala asp ser ala ser ala gly leu val glu val 
gly trp ser his ser met pro his trp pro ile gin his pro gin ala ser ser arg cys 
val gly his ile arg cys arg thr gly arg phe ser ile arg arg pro arg arg gly val 
481/161 511/171 

ttg tag tea ccg ggg atg ggc ggc teg cca ggc ctg cag gat ctg egg gcg cag gcg ccc 
leu AMB ser pro gly met gly gly ser pro gly leu gin asp leu arg ala gin ala pro 
cys ser his arg gly trp ala ala arg gin ala cys arg ile cys gly arg arg arg pro 
val val thr gly asp gly arg leu ala arg pro ala gly ser ala gly ala gly ala pro 
541/181 571/191 

ccg gtc gga cac egg cag gec gac get ttt ggc cca cgc gcg cag etc ggc get get ggg 
pro val gly his arg gin ala asp ala phe gly pro arg ala gin leu gly ala ala gly 
arg ser asp thr gly arg pro thr leu leu ala his ala arg ser ser ala leu leu gly 
gly arg thr pro ala gly arg arg phe trp pro thr arg ala ala arg arg cys trp ala 
601/201 631/211 

etc ggg etc ggc ggc age egg etc gaa aac cgt ggt ggc gtc ggc ate gtc gac gaa cca 
leu gly leu gly gly ser arg leu glu asn arg gly gly val gly ile val asp glu pro 
ser gly ser ala ala ala gly ser lys thr val val ala ser ala ser ser thr asn gin 
arg ala arg arg gin pro ala arg lys pro trp trp arg arg his arg arg arg thr arg 



SEQ ID No.lSQ 
FIGURE 15Q 



REPLACEMENT SHEET (RULE 26) 



55/185 



661/221 691/231 

ggt gag ggc ggc ggc tag ata gcg gta ggt gta ttc ctg ggc gag ctt gcg ggt ttg gca 
gly glu gly gly gly AMB ile ala val gly val phe leu gly glu leu ala gly leu ala 
val arg ala ala ala arg AMB arg AMB val tyr ser trp ala ser leu arg val trp gin 
OPA gly arg arg leu asp ser gly arg cys ile pro gly arg ala cys gly phe gly arg 
721/241 - 751/251 

gaa cac gat egg cac gtt ggg aaa gec gat ctg caa ttc ggc cag ccc ate ggc gat cgc 
glu his asp arg his val gly lys ala asp leu gin phe gly gin pro ile gly asp arg 
asn thr ile gly thr leu gly lys pro ile cys asn ser ala ser pro ser ala ile ala 
thr arg ser ala arg trp glu ser arg ser ala ile arg pro ala his arg arg ser pro 
781/261 811/271 

cgt egg gcg ggc gaa gga gtg cgc gaa gat etc cga gta gcg gtc etc gac cac cac ggc 
arg arg ala gly glu gly val arg glu asp leu arg val ala val leu asp his his gly 
val gly arg ala lys glu cys ala lys ile ser glu AMB arg ser ser thr thr thr ala 
ser gly gly arg arg ser ala arg arg ser pro ser ser gly pro arg pro pro arg arg 
841/281 871/291 

ggc ccg tgg cag cgc ggc cag ttc ggt cag ttg gta ttt cag gtt gee gtt cag cac gec 
gly pro trp gin arg gly gin phe gly gin leu val phe gin val ala val gin his ala 
ala arg gly ser ala ala ser ser val ser trp tyr phe arg leu pro phe ser thr pro 
pro val ala ala arg pro val arg ser val gly ile ser gly cys arg ser ala arg gin 
901/301 931/311 

aga agt aag gtc cgc caa cgc ttt acg etc gac ggc cgc cac gag ttg gec ggc cac ttt 
arg ser lys val arg gin arg phe thr leu asp gly arg his glu leu ala gly his phe . 
glu val arg ser ala asn ala leu arg ser thr ala ala thr ser trp pro ala thr phe 
lys OCH gly pro pro thr leu tyr ala arg arg pro pro arg val gly arg pro leu ser 
961/321 991/331 

cag gec gta gtc gec gca ggg cag ggc ttc ccg cgt cgt ctt cgc ggg ttt gtc ggc aaa 
gin ala val val ala ala gly gin gly phe pro arg arg leu arg gly phe val gly lys 
arg pro AMB ser pro gin gly arg ala ser arg val val phe ala gly leu ser ala lys 
gly arg ser arg arg arg ala gly leu pro ala ser ser ser arg val cys arg gin arg 
1021/341 1051/351 

ggt gta ggg gta gcg ttc gtg ggc gtc gac gac gat gtg cag etc ggg gat gee ggc ggc 
gly val gly val ala phe val gly val asp asp asp val gin leu gly asp ala gly gly 
val AMB gly AMB arg ser trp ala ser thr thr met cys ser ser gly met pro ala ala 

cys arg gly ser val arg gly arg arg arg arg cys ala ala arg gly cys arg arg arg 
1081/361 1111/371 




gly arg trp gly cys ala arg pro ala ala thr val cys ala phe trp gly ser ala arg 
1141/381 1171/391 

gaa cac cac ctg gcg gee gcg cgc cat ggt gtg cac cag ttg cga teg gtt etc ccg cgc 
glu his his leu ala ala ala arg his gly val his gin leu arg ser val leu pro arg 
asn thr thr trp arg pro arg ala met val cys thr ser cys asp arg phe ser arg ala 
thr pro pro gly gly arg ala pro trp cys ala pro val ala ile gly ser pro ala arg 
1201/401 1231/411 

gcg ggc ggc gac gac gtc gat ggc cgc gee ccg gcg get gca get gcg tag etc gac ccg 
ala gly gly asp asp val asp gly arg ala pro ala ala ala ala ala AMB leu asp pro 
arg ala ala thr thr ser met ala ala pro arg arg leu gin leu arg ser ser thr arg 
gly arg arg arg arg arg trp pro arg pro gly gly cys ser cys val ala arg pro gly 
1261/421 1291/431 

gtc gac gac gac ggg gtc ggc ggg cca gtc ggc gat gtc gag gcg atg gca ata cag cgc 
val asp asp asp gly val gly gly pro val gly asp val glu ala met ala lie gin arg 
ser thr thr thr gly ser ala gly gin ser ala met ser arg arg trp gin tyr ser ala 
arg arg arg arg gly arg arg ala ser arg arg cys arg gly asp gly asn thr ala pro 



SEQ ID No.lSQ (continued 1) 
FIGURE 15Q (continued 1) 



REPLACEMENT SHEET (RULE 26) 




56/185 



1321/441 1351/451 

ctt ggt gcg egg cca cac gtc tga ggt ggc gaa gac cag tec cgc gec cac egg cag ccg 
leu gly ala arg pro his val OPA gly gly glu asp gin ser arg ala his arg gin pro 
leu val arg gly his thr ser glu val ala lys thr ser pro ala pro thr gly ser arg 
trp cys ala ala thr arg leu arg trp arg arg pro val pro arg pro pro ala ala gly 
1381/461 1411/471 

gat cag gta ggg cag gcg cga gtc ttc age ggg gtt ggc ggc gac gag cag etc cac aga 
asp gin val gly gin ala arg val phe ser gly val gly gly asp glu gin leu his arg 
ile arg AMB gly arg arg glu ser ser ala gly leu ala ala thr ser ser ser thr glu 
ser gly arg ala gly ala ser leu gin arg gly trp arg arg arg ala ala pro gin ser 
1441/481 1471/491 

gtg tga ggg tac ggg egg cgt acg gca acg gtg aag cag gca etc cga cga acc cat cgt 
val OPA gly tyr gly arg arg thr ala thr val lys gin ala leu arg arg thr his arg 
cys glu gly thr gly gly val arg gin arg OPA ser arg his ser asp glu pro ile val 
val arg val arg ala ala tyr gly asn gly glu ala gly thr pro thr asn pro ser ser 
1501/501 

cac gtc gaa ggg gca ggt ga 
his val glu gly ala gly 
thr ser lys gly gin val 
arg arg arg gly arg OPA 



SEQ ID No.lSQ (continued 2) 
FIGURE 15Q (continued (2) 



31/11 



TGC GCA 


TGC 


CGA 


CCA 


GTG 


TGG 


TTG 


GCC 


GGA 


GTT CGT 


TTG 


TTC 


GCG 


ATT 


GCC 


TCA 


ACG 


ATT 


cys ala 


cys 


arg 


pro 


val 


trp 


leu 


ala 


gly 


val arg 


leu 


phe 


ala 


ile 


ala 


ser 


thr 


ile 


61/21 


















91/31 














TCA 


TGC 


CGA TAT 


AAC 


CAC 


TCT 


AGT 


CAC 


ATC 


AAC 


CAC 


ACT CGT 


ACC 


ATC 


GAG 


CGT 


GTG 


GGT 


arg tyr 


asn 


his 


ser 


ser 


his 


ile 


asn 


his 


thr arg 


thr 


ile 


glu 


arg 


val 


gly 


ser 


cys 


121/41 


















151/51 


















CAT GCA 


TTC 


GCG 


ACC 


GCG 


GGA 


GCC 


GGC 


gaa' 


CCC GGC 


GCC 


ACA 


CAT 


AAT 


CCA 


GAT 


TGA 


GGA 


his ala 


phe 


ala 


thr 


ala 


gly 


ala 


gly 


glu 


pro gly 


ala 


thr 


his 


asn 


pro 


asp 


OPA 


gly 


181/61 
















211/71 
















CGC 


GAC TTC 


CGT 


GCC 


GAA 


CCG 


ACG 


CCG 


ACG 


CAA 


GCT TTC 


GAC 


AGC 


CAT 


GAG 


CGC 


GGT 


CGC 


asp phe 


arg 


ala 


glu 


pro 


thr 


pro 


thr 


gin 


ala phe 


asp 


ser 


his 


glu 


arg 


gly 


arg 


arg 


241/81 
















271/91 














GAC 


CGA 


CCT GGC 


AGT 


TGC 


AAG 


TCC 


TTG 


TGC 


ATA 


TTT 


TCT TGT 


CTA 


CGA 


ATC 


AAC 


CGA 


AAC 


pro gly 


ser 


cys 


lys 


ser 


leu 


cys 


ile 


phe 


ser cys 


leu 


arg 


ile 


asn 


arg 


asn 


asp 


arg 


301/101 


















331/111 














CGA 


GCT 


GCG GCC 


CGA 


GCA 


CCA 


TGA 


ATT 


CAA 


GCA 


GGC 


GGC GGT 


GTT 


GAC 


CGA 


CCT 


GCC 


CGG 


ala ala 


arg 


ala 


pro 


OPA 


ile 


gin 


ala 


gly 


gly gly 


val 


asp 


arg 


pro 


ala 


arg 


arg 


ala 


361/121 














391/131 


















GAT GTC 


CGC 


GCT 


ATC 


GCA 


GGG 


GTT 


GTC 


CCA 


GTT CGG 


GAT 


C 














asp val 


arg 


ala 


ile 


ala 


gly 


val 


val 


pro 


val arg 


asp 

















SEQ ID N0.I6A 
FIGURE 16A 



REPLACEMENT SHEET (RULE 26) 



57/185 



32/11 



CCCr CAT 


GCC 


GAC 


CAG 


TGT 


GGT 


TGG 


CCG 


GAG 


TTC GTT 


TGT 


TCG 


CGA 


TTG 


CCT 


CAA 


CGA 


TTC 


ulu Ills 


ala 


asp 


gin 


CVS 


crl v 


trp 


pro 


glu 


phe val 


cys 


ser 


arg 


leu 


pro 


gin 


arg 


phe 


69 / 9 1 


















92/31 


















HAT ATA 


ACC 


ACT 


CTA 


GTC 


ACA 


TCA 


ACC 


ACA 


CTC GTA 


CCA 


TCG 


AGC 


GTG 


TGG 


GTT 


CAT 


GCC 


^ ^T"> 1 1 P 


thr 


thr 


leu 


val 


thr 


ser 


thr 


thr 


leu val 


pro 


ser 


ser 


val 


trp 


val 


his 


ala 


199/4 1 


















152/51 


















ATG CAT 


TCG 


CGA 


CCG 


CGG 


GAG 


CCG 


GCG 


AAC 


CCG GCG 


CCA 


CAC 


ATA 


ATC 


CAG 


ATT 


GAG 


GAG 


met his 


ser 


arg 


pro 


arg 


glu 


pro 


ala 


asn 


pro ala 


pro 


his 


ile 


ile 


gin 


ile 


glu 


glu 


182/61 


















212/71 


















ACT TCC 


GTG 


CCG 


AAC 


CGA 


CGC 


CGA 


CGC 


AAG 


CTT TCG 


ACA 


GCC 


ATG 


AGC 


GCG 


GTC 


GCC 


GCC 


thr ser 


val 


pro 


asn 


arg 


arg 


arg 


arg 


lys 


leu ser 


thr 


ala 


met 


ser 


ala 


val 


ala 


ala 


242/81 
















272/91 


















CTG GCA 


GTT 


GGA 


AGT 


CCT 


TGT 


GCA 


TAT 


TTT 


CTT GTC 


TAC 


GAA 


TCA 


ACC 


GAA 


ACG 


ACC 


GAG 


leu ala 


val 


ala 


ser 


pro 


cys 


ala 


tyr 


phe 


leu val 


tyr 


glu 


ser 


thr 


glu 


thr 


thr 


glu 


302/101 


















332/111 


















CGG CCC 


GAG 


CAC 


CAT 


GAA 


TTC 


AAG 


CAG 


GCG 


GCG GTG 


TTG 


ACC 


GAC 


CTG 


CCC 


GGC 


GAG 


CTG 


arg pro 


glu 


his 


his 


glu 


phe 


lys 


gin 


ala 


ala val 


leu 


thr 


asp 


leu 


pro 


gly 


glu 


leu 


362/121 
















392/131 


















ATG TCC 


GCG 


CTA 


TCG 


CAG 


GGG 


TTG 


TCC 


CAG 


TCC GGG 


ATC 
















met ser 


ala 


leu 


ser 


gin 


gly 


leu 


ser 


gin 


phe gly 


ile 

















SEQ ID N0.I6B 
FIGURE 16B 



CGC ATG 


CCG 


ACC 


AGT 


GTG 


GTT 


arg met 


pro 


thr 


ser 


val 


val 


63/21 












ATA TAA' CCA 


CTC 


TAG 


TCA 


CAT 


ile OCH 


pro 


leu 


AMB 


ser 


his 


123/41 












TGC ATT 


CGC 


GAC 


CGC 


GGG 


AGC 


cys ile 


arg 


asp 


arg 


gly 


ser 


183/61 












CTT CCG 


TGC 


CGA 


ACC 


GAC 


GCC 


leu pro 


cys 


arg 


thr 


asp 


ala 


243/81 












TGG CAG 


TTG 


CAA 


GTC 


CTT 


GTG 


trp gin 


leu 


gin 


val 


leu 


val 


303/101 












GGC CCG 


AGC 


ACC 


ATG 


AAT 


TCA 


gly pro 


ser 


thr 


met 


asn 


ser 


363/121 












TGT CCG 


CGC 


TAT 


CGC 


AGG 


GGT 


cys pro 


arg 


tyr 


arg 


arg 


gly 









33/11 




GGC 


CGG 


AGT 


TCG TTT 


GTT 


gly 


arg 


ser 


ser phe 


val 








93/31 




CAA 


CCA 


CAC 


TCG TAC 


CAT 


gin 


pro 


his 


ser tyr 


his 








153/51 




CGG 


CGA 


ACC 


CGG CGC 


CAC 


arg 


arg 


thr 


arg arg 


his 








213/71 




GAC 


GCA 


AGC 


TTT CGA 


CAG 


asp 


ala 


ser 


phe arg 


gin 








273/91 




CAT 


ATT 


TTC 


TTG TCT 


ACG 


his 


ile 


phe 


leu ser 


thr 








333/111 




AGC 


AGG 


CGG 


CGG TGT 


TGA 


ser 


arg 


arg 


arg cys 


OPA 








393/131 




TGT 


CCC 


AGT 


TCG GGA 


TC 


cys 


pro 


ser 


ser gly 





SEQ ID N0.I6C 
FIGURE 16C 



CGC GAT TGC CTC AAC GAT TCG 
arg asp cys leu asn asp ser 

CGA GCG TGT GGG TTC ATG CCA 
arg ala cys gly phe met pro 

ACA TAA TCC AGA TTG AGG AGA 
thr OCH ser arg leu arg arg 

CCA TGA GCG CGG TCG CCG CCC 
pro OPA ala arg ser pro pro 

AAT CAA CCG AAA CGA CCG AGC 
asn gin pro lys arg pro ser 

CCG ACC TGC CCG GCG AGC TGA 
pro thr cys pro ala ser OPA 



REPLACEMENT SHEET (RULE 26) 



58/185 



GCG GGC CAC CGA TCA GTC GAT 
ala gly his arg ser val asp 
61/21 

GCA GTG ACG ACA CCA GCG CAG 
ala val thr thr pro ala gin 
121/41 

GGC TCG CCT TTT TTT CAT CAA 
gly ser pro phe phe his gin 
181/61 

TTC GGT CAC CTG ACG GTC GGG 
phe gly his leu thr val gly 
241/81 

TGC CCT GCT GGT GCG GCG TTC 
cys pro ala gly ala ala phe 
301/101 

GTC GAT GGC CCT CAA CTC GGC 
val asp gly pro gin leu gly 









31/11 




CGG 


GTG 


GTT 


TCC GCT 


CCA 


arcr 


val 


val 


ser ala 


pro 








91/31 




GAC 


GCG 


CCG 


TTG GTG 


TTT 


asp 


ala 


pro 


leu val 


phe 








151/51 




CGT 


TGG 


ACT 


GCC GCA 


GTG 


arg 


trp 


thr 


ala ala 


val 








211/71 




ATG 


TTC 


CTT 


GGG TCT 


CGG 


met 


phe 


leu 


gly ser 


arg 








271/91 




GGC 


CGA 


GTC 


GAT CAC 


CGC 


gly 


arg 


val 


asp his 


arg 








331/111 




ATC 


GCG 


ACT 


GGC GAT 


TAT 


ile 


ala 


thr 


gly asp 


tyr 



SEQ ID N0.17A 



TCA GCC CGG AAT TGA GGT GCC 
ser ala arg asn OPA gly ala 

CCC TCT GTT GCT TTC CCG TCC 
pro ser val ala phe pro ser 

GCG ATG TTG GTC GCC GGC GTG 
ala met leu val ala gly val 

GTT GCT GCT GGG TTT GCT CAA 
val ala ala gly phe ala gin 

CAA AGA GCA CCC GTT AAA ACG 
gin arg ala pro val lys thr 

CAC CAT GCC TCG GGC TGA TC 
his his ala ser gly OPA 



FIGURE 17A 



CGG GCC ACC GAT CAG 
arg ala thr asp gin 
62/21 

CAG TGA CGA CAC CAG 
gin OPA arg his gin 
122/41 

GCT CGC CTT TTT TTC 
ala arg leu phe phe 
182/61 

TCG GTC ACC TGA CGG 
ser val thr OPA arg 
242/81 

GCC CTG CTG GTG CGG 
ala leu leu val arg 
302/101 

TCG ATG GCC CTC AAC 
ser met ala leu asn 



TCG ATC GGG TGG TTT 
ser ile gly trp phe 

CGC AGG ACG CGC CGT 
arg arg thr arg arg 

ATC AAC GTT GGA CTG 
ile asn val gly leu 

TCG GGA TGT TCC TTG 
ser gly cys ser leu 

CGT TCG GCC GAG TCG 
arg ser ala glu ser 

TCG GCA TCG CGA CTG 
ser ala ser arg leu 



32/11 

CCG CTC CAT CAG CCC 
pro leu his gin pro 
92/31 

TGG TGT TTC CCT CTG 
trp cys phe pro leu 
152/51 

CCG CAG TGG CGA TGT 
pro gin trp arg cys 
212/71 

GGT CTC GGG TTG CTG 
gly leu gly leu leu 
272/91 

ATC ACC GCC AAA GAG 
ile thr ala lys glu 
332/111 

GCG ATT ATC ACC ATG 
ala ile ile thr met 



GGA ATT GAG GTG CCG 
gly ile glu val pro 

TTG CTT TCC CGT CCG 
leu leu ser arg pro 

TGG TCG CCG GCG TGT 
trp ser pro ala cys 

CTG GGT TTG CTC AAT 
leu gly leu leu asn 

CAC CCG TTA AAA CGG 
his pro leu lys arg 

CCT CGG GCT GAT C 
pro arg ala asp 



SEQ ID No.l7B 



FIGURE 17B 



REPLACEMENT SHEET (RULE 26) 



59/185 



GGG CCA 


CCG 


ATC 


AGT 


CGA 


rp /-« 


gly pro 


pro 


lie 


ser 


a rg 


ser 


63/21 












AGT GAC 


GAC 


ACC 


AGC 


GCA 




ser asp 


asp 




ser 


ala 


alv 

y _l j 


123/41 












CTC GCC 


TTT 


TTT 


TCA 


TCA 


ACG 


leu ala 


phe 


phe 


ser 


ser 


thr 


183/61 












CGG TCA 


CCT 


GAC 


GGT 


CGG 


GAT 


arg ser 


pro 


asp 


gly 


arg 


asp 


243/81 












CCC TGC 


TGG 


TGC 


GGC 


GTT 


CGG 


pro cys 


trp 


cys 


gly 


val 


arg 


303/101 












CGA TGG 


CCC 


TCA 


ACT 


CGG 


CAT 


arg trp 


pro 


ser 


thr 


arg 


his 









1 

O O / 11 




GGT 


/— /— rri 


1 11- 




ATC 


gly 


gly 


phe 


arg ser 


ile 












CGC 




\j 1 1 


1 Oil 


TCC 


arg 


ala 


val 


gly val 


ser 








153/51 




TTG 


GAC 


TGC 


CGC AGT 


GGC 


leu 


asp 


cys 


arg ser 


gly 








213/71 




GTT 


CCT 


TGG 


GTC TCG 


GGT 


val 


pro 


trp 


val ser 


gly 








273/91 




CCG 


AGT 


CGA 


TCA CCG 


CCA 


pro 


ser 


arg 


ser pro 


pro 






333/111 




CGC 


GAC 


TGG 


CGA TTA 


TCA 


arg 


asp 


trp 


arg leu 


ser 



SEQ ID No.l7C 
FIGURE 17C 



AGC CCG GAA TTG AGG TGC CGC 
ser pro glu leu arg cys arg 

CTC TGT TGC TTT CCC GTC CGG 
leu cys cys phe pro val arg 

GAT GTT GGT CGC. CGG CGT GTT 
asp val gly arg arg arg val 

TGC TGC TGG GTT TGC TCA ATG 
cys cys trp val cys ser met 

AAG AGC ACC CGT TAA AAC GGT 
lys ser thr arg OCH asn gly 

CCA TGC CTC GGG CTG ATC 
pro cys leu gly leu ile 



part of the nucleotide sequence of seq!7A 
1/1 31/11 

ggc tag aac ccc gaa gga gac etc gcg ggt tgc egg ccc ccg gec cat egg atg cgt ate 
aly AMB asn pro glu gly asp leu ala gly cys arg pro pro ala his arg met arg lie 

91/31 

egg teg cgc cga ttc acg acc gac ata ggg age tac ccc ttg ggt gat tec ggt gcg acg 
arg ser arg arg phe thr thr asp ile gly ser tyr pro leu gly asp ser gly ala thr 
121/41 151/51 

act gcg ata cgc teg gcg ggc cac cga tea gtc gat egg gtg gtt tec get cca tea gec 
thr ala ile arg ser ala gly his arg ser val asp arg val val ser ala pro ser ala 
181/61 211/71 

egg aat tga ggt gee gca gtg acg aca cca gcg cag gac gcg ccg ttg gtg ttt ccc tct 
arg asn OPA gly ala ala val thr thr pro ala gin asp ala pro leu val phe pro ser 
241/81 271/91 

qtt get ttc cgt ccg gtt cgc ctt ttt ttc ate aac- gtt gga ctg gec gca gtg gcg atg 
val ala phe arg pro val arg leu phe phe ile asn val gly leu ala ala val ala met 

301/101 331/111 

ttg gtc gec ggc gtg ttc ggt cac ctg acg gtc ggg atg ttc ttg ggt etc ggg ttg ctg 
leu val ala gly val phe gly his leu thr val gly met phe leu gly leu gly leu leu 
361/121 391/131 

ctg ggt ttg etc aat gee ctg ctg gtg egg cgt teg gec gag teg ate acc gec aaa gag 
leu gly leu leu asn ala leu leu val arg arg ser ala glu ser He thr ala lys glu 
421/141 451/151 

cac ccg tta aaa egg teg atg gec etc aac teg gca teg cga ctg gcg att ate acc ate 
his pro leu lys arg ser met ala leu asn ser ala ser arg leu ala xle lie thr lie 
481/161 

etc ggg ctg ate 
leu gly leu ile 

SEQ ID N0.17A' 
FIGURE 17A' 



RF-PI ACEMFNT SHEET (RULE 26) 



60/185 



l/l 

get aga acc ccg aag gag acc teg egg gtt 
ala arg thr pro lys glu thr ser arg val 
61/21 

ggt cgc gee gat tea cga ccg aca tag gga 
gly arg ala asp ser arg pro thr AMB gly 
121/41 

ctg cga tac get egg egg gee acc gat cag 
leu arg tyr ala arg arg ala thr asp gin 
181/61 

gga att gag gtg ccg cag tga cga cac cag 
gly ile glu val pro gin OPA arg his gin 
241/81 

ttg ctt tec gtc egg ttc gec ttt ttt tea 
leu leu ser val arg phe ala phe phe ser 
301/101 

tgg teg ccg gcg tgt teg gtc acc tga egg 
trp ser pro ala cys ser val thr OPA arg 
361/121 

tgg gtt tgc tea atg ccc tgc tgg tgc ggc 
trp val cys ser met pro cys trp cys gly 
421/141 

acc cgt taa aac ggt cga tgg ccc tea act 
thr arg OCH asn gly arg trp pro ser thr 
481/161 

teg ggc tga tc 
ser gly OPA 



3.1/11 

gee ggc ccc egg ccc ate gga tgc gta tec 
ala gly pro arg pro ile gly cys val ser 
91/31 

get acc cct tgg gtg att ccg gtg cga cga 
ala thr pro trp val ile pro val arg arg 
151/51 

teg ate ggg tgg ttt ccg etc cat cag ccc 
ser ile gly trp phe pro leu his gin pro 
211/71 

cgc agg acg cgc cgt tgg tgt ttc cct ctg 
arg arg thr arg arg trp cys phe pro leu 
271/91 

tea acg ttg gac tgg ccg cag tgg cga tgt 
ser thr leu asp trp pro gin trp arg cys 
331/111 

teg gga tgt tct tgg gtc teg ggt tgc tgc 
ser gly cys ser trp val ser gly cys cys 
391/131 

gtt egg ccg agt cga tea ccg cca aag age 
val arg pro ser arg ser pro pro lys ser 
451/151 

egg cat cgc gac tgg cga tta tea cca tec 
arg his arg asp trp arg leu ser pro ser 



SEQ ID No.l7B f 
FIGURE 17B f 



l/l 

eta gaa ccc cga agg aga cct cgc ggg ttg 
leu glu pro arg arg arg pro arg gly leu 
61/21 

gtc gcg ccg att cac gac cga cat agg gag 
val ala pro ile his asp arg his arg glu 
121/41 

tgc gat acg etc ggc ggg cca ccg ate agt 
cys asp thr leu gly gly pro pro ile ser 
181/61 

gaa ttg agg tgc cgc agt gac gac acc age 
glu leu arg cys arg ser asp asp thr ser 
241/81 

tgc ttt ccg tec ggt teg cct ttt ttt cat 
cys phe pro ser gly ser pro phe phe his 
301/101 

ggt cgc egg cgt gtt egg tea cct gac ggt 
gly arg arg arg val arg ser pro asp gly 
361/121 

ggg ttt get caa tgc cct get ggt gcg gcg 
gly phe ala gin cys pro ala gly ala ala 
421/141 

ccc gtt aaa acg gtc gat ggc cct caa etc 

pro val lys thr val asp gly pro gin leu 

481/161 

egg get gat c 

arg ala asp 



31/11 

ccg gee ccc ggc cca teg gat gcg tat ccg 
pro ala pro gly pro ser asp ala tyr pro 
91/31 

eta ccc ctt ggg tga ttc egg tgc gac gac 
leu pro leu gly OPA phe arg cys asp asp 
151/51 

cga teg ggt ggt ttc cgc tec ate age ccg 
arg ser gly gly phe arg ser ile ser pro 
211/71 

gca gga cgc gee gtt ggt gtt tec etc tgt 
ala gly arg ala val gly val ser leu cys 
271/91 

caa cgt tgg act ggc cgc agt ggc gat gtt 
gin arg trp thr gly arg ser gly asp val 
331/111 

egg gat gtt ctt ggg tct egg gtt get get 
arg asp val leu gly ser arg val ala ala 
391/131 

ttc ggc cga gtc gat cac cgc caa aga gca 
phe gly arg val asp his arg gin arg ala 
451/151 

ggc ate gcg act ggc gat tat cac cat cct 
gly ile ala thr gly asp tyr his his pro 



SEQ ID No-17C f 



FIGURE 17C 
REPLACEMENT SHEET (RULE 26) 



61/185 

sequence Rvl303 predicted by Cole et al. (Nature 393:537-544) and partially containing 
Seql7A" 

1/1 31/11 

atq acq aca cca gcg cag gac gcg ccg ttg gtg ttt ccc tct gtt get ttc cgt ccg gtt 
met thr thr pro ala gin asp ala pro leu val phe pro ser val ala phe arg pro val 
61/21 91/31 

cac ctt ttt ttc ate aac gtt gga ctg gec gca gtg gcg atg ttg gtc gec ggc gtg ttc 
arc ieu phe phe ile asn val gly leu ala ala val ala met leu val ala gly val phe 
121/41 ' 151/51 

qgt cac ctg acg gtc ggg atg ttc ttg ggt etc ggg ttg ctg ctg ggt ttg etc aat gec 
alv his leu thr val gly met phe leu gly leu gly leu leu leu gly leu leu asn ala 
181/61 211/71 

ctg ctg gtg egg cgt teg gec gag teg ate acc gec aaa gag cac ccg tta aaa egg teg 
leu leu val arg arg ser ala glu ser ile thr ala lys glu his pro leu lys arg ser 
241/81 271/91 

atg gee etc aac teg gca teg cga ctg gcg att ate ace ate etc ggg ctg ate ate gec 
met ala leu asn ser ala ser arg leu ala ile ile thr ile leu gly leu lie lie ala 
301/101 331/111 

tac att ttc egg ccc get gga ttg ggc gtc gtg ttc ggg ctg gca ttc ttc cag gtg ctg 
tvr ile phe arg pro ala gly leu gly val val phe gly leu ala phe phe gin val leu 
361/121 391/131 

ctg gtg gca acg acg gee ctg ccg gtc ctg aag aag ctg cgc act gcg ace gag gaa ccg 
leu val ala thr thr ala leu pro val leu lys lys leu arg thr ala thr glu glu pro 
421/141 451/151 

ate gca act tat tct tec aat ggc cag ace ggg gga teg gaa gga agg age gee age gat 
val ala thr tyr ser ser asn gly gin thr gly gly ser glu gly arg ser ala ser asp 



481/161 
gac tga 
asp OPA 



SEQ ID No.l7D 
FIGURE 17D 

Orf according to Cole et al. (Nature 393:537-544) and containing Rvl303 
1/1 31/11 

tqa qgt gee gca gtg acg aca cca gcg cag gac gcg ccg ttg gtg ttt ccc tct gtt get 
OPA gly ala ala val thr thr pro ala gin asp ala pro leu val phe pro ser val ala 
61/21 91/31 

ttc cgt ccg gtt cgc ctt ttt ttc ate aac gtt gga ctg gee gca gtg gcg atg ttg gtc 
phe arg pro val arg leu phe phe ile asn val gly leu ala ala val ala met leu val 

151/51 

gcc'ggc gtg ttc ggt cac ctg acg gtc ggg atg ttc ttg ggt etc ggg ttg ctg ctg ggt 
ala gly val phe gly his leu thr val gly met phe leu gly leu gly leu leu leu gly 
181/61 211/71 

ttg etc aat gee ctg ctg gtg egg cgt teg gec gag teg ate ace gee aaa gag cac ccg 
leu leu asn ala leu leu val arg arg ser ala glu ser ile thr ala lys glu his pro 
241/81 271/91 

tta aaa egg teg atg gec etc aac teg gca teg cga ctg gcg att ate ace ate etc ggg 
leu lys arg ser met ala leu asn ser ala ser arg leu ala ile ile thr lie leu gly 
301/101 331/111 

ctg ate ate gec tac att ttc egg ccc get gga ttg ggc gtc gtg ttc ggg ctg gca ttc 
leu ile ile ala tyr ile phe arg pro ala gly leu gly val val phe gly leu ala phe 
361/121 391/131 

ttc cag gtg ctg ctg gtg gca acg acg gee ctg ccg gtc ctg aag aag ctg cgc act gcg 
phe gin val leu leu val ala thr thr ala leu pro val leu lys lys leu arg thr ala 
421/141 451/151 

acc aaa gaa ccg gtc gca act tat tct tec aat ggc cag acc ggg gga teg gaa gga agg 
thr Slu glu pro val ala thr tyr ser ser asn gly gin thr gly gly ser glu gly arg 
481/161 

age gee age gat gac tga 
ser ala ser asp asp OPA 

SEQ ID N0.17F 
FIGURE 17F 



REPLACEMENT SHEET (RULE 26) 



• 



62/185 



GTC GAA CAG GTA CGG AAG GCG 
val glu gin val arg lys ala 
61/21 

AGC CAG CGG CCG TTA ACG TGG 
ser gin arg pro leu thr trp 
121/41 

TGG CTC AGG TCG ATA CCC GAG 
trp leu arg ser ile pro glu 
181/61 

TCG GGT GCA ACG ATC GGG CCA 
ser gly ala thr ile gly pro 
241/81 

ATG CGG AAG ACG ACT CGC GGC 
met arg lys thr thr arg gly 
301/101 

CAG CCG GTC CCG ATG TTC TCG 
gin pro val pro met phe ser 
361/121 

GTC GCC GCG GTC GTG CTG GGT 
val ala ala val val leu gly 



31/11 

CCG TCG GTC GCT CGG TCC 
pro ser val ala arg ser 
91/31 

CCG AAC AGG TCG TCT TGG 
pro asn arg ser ser trp 
151/51 

GGG ATG GCA AGT GTC ACC 
gly met ala ser val thr 

211/71 
TGC CTG ACG GGG AGC AGA 
cys leu thr gly ser arg 

271/91 
CCG ACG CCG CGG AGG CCG 
pro thr pro arg arg pro 

331/111 
ACC TAC GGT ATC GCC TCG 
thr tyr gly ile ala ser 

GCG ATG ATC 
ala met ile 



SEQ ID N0.I8A 
FIGURE 18A 



GCT GGT ATC TCG TGT TCA GCC 
ala gly ile ser cys ser ala 

GGT CGG GCA TCA GCG TCG ATG 
gly arg ala ser ala ser met 

CCG CCA TCC TTC CAC CTC TTT 
pro pro ser phe his leu phe 

GCC AGC CAC CGG CCC AAG AAG 
ala ser his arg pro lys lys 

CCG CGG CCG AAC CCA AAT CAT 
pro arg pro asn pro asn his 

ACA CTA CTC GGC GTG CTA TCG 
thr leu leu gly val leu ser 



TCG AAC AGG TAC GGA AGG CGC CGT CGG TCG CT^GGT CCG CTG GTA TCT CGT GTT CAG CCA 
ser asn arg tyr gly arg arg arg arg ser leu gly pro leu val ser arg val gin pro 

fi? / ? 1 92 / 31 

GCC AGC GGC CGT TAA CGT GGC CGA ACA GGT CGT CTT GGG GTC GGG CAT CAG CGT CGA TGT 
ala ser gly arg OCH arg gly arg thr gly arg leu gly val gly his gin arg arg cys 

GGC ^ TCA GGT CGA TAC CCG AGG GGA TGG CAA GTG TCA CCC CGC CAT CCT TCC ACC TCT TTT 
gly ser gly arg tyr pro arg gly trp gin val ser pro arg his pro ser thr ser phe 

212/71 

CGG^GTG CAA CGA TCG GGC CAT GCC TGA CGG GGA GCA GAG CCA GCC ACC GGC CCA AGA AGA 
arg 111 gin arg ser gly his ala OPA arg gly ala glu pro ala thr gly pro arg arg 

TGC^GA AGA CGA CTC GCG GCC CGA CGC CGC GGa'gGC CGC CGC GGC CGA ACC CAA ATC ATC 
cvs gly arg arg leu ala ala arg arg arg gly gly arg arg gly arg thr gin ile He 

332/111 

lie III TCC CGA TGT TCT CGA CCT ACG GTA TCG CCT CGA CAC TAC TCG GCG TGC TAT CGG 
ser arg ser arg cys ser arg pro thr val ser pro arg his tyr ser ala cys tyr arg 
362/121 

TCG CCG CGG TCG TGC TGG GTG CGA TGA TC 
ser pro arg ser cys trp val arg OPA 



SEQ ID N0.I8B 
FIGURE 18B 



REPLACEMENT SHEET (RULE 26) 



63/185 



3/1 












CGA AC A 


KD\3 1 




GAA 


GGC 


GCC 


arg thr 


gly 


thr 


glu 


al v 


ala 


63/21 












CCA GCG 






AAC 


GTG 


GCC 


pro ala 


sis 

ci x a 


val 


asn 


val 


ala 


123/ 41 












GCT CAG 








CGA 


GGG 


a 1 a rrl n 
did y 


val 


asp 


thr 


arg 


gly 


183/61 












GGG TGC 


AAC 


GAT 


CGG 


GCC 


ATG 


gly cys 


asn 


asp 


arg 


ala 


met 


243/81 












GCG GAA 


GAC 


GAC 


TCG 


CGG 


CCC 


ala glu 


asp 


asp 


ser 


arg 


pro 


303/101 












GCC GGT 


CCC 


GAT 


GTT 


CTC 


GAC 


ala gly 


pro 


asp 


val 


leu 


asp 


363/121 












CGC CGC 


GGT 


CGT 


GCT 


GGG 


TGC 


arg arg 


gly 


arg 


ala 


gly 


cys 









33/11 




GTC 


GGT 


CGC 


TCG GTC 


CGC 


val 


gly 


arg 


ser val 


arg 








93/31 




GAA 


CAG 


GTC 


GTC TTG 


GGG 


glu 


gin 


val 


val leu 


gly 












GAT 


GGC 


AAG 


TGT CAC 


CCC 


asp 


gly 


lys 


cys his 


pro 








213/71 




CCT 


GAC 


GGG 


GAG CAG 


AGC 


pro 


asp 


gly 


glu gin 


ser 






273/91 




GAC 


GCC 


GCG 


GAG GCC 


GCC 


asp 


ala 


ala 


glu ala 


ala 






333/111 




CTA 


CGG 


TAT 


CGC CTC 


GAC 


leu 


arg 


tyr 


arg leu 


asp 


GAT 


GAT 


C 






asp 


asp 









TGG TAT CTC GTG TTC AGC CAG 
trp tyr leu val phe ser gin 

TCG GGC ATC AGC GTC GAT GTG 
ser gly ile ser val asp val 

GCC ATC CTT CCA CCT CTT TTC 
ala ile leu pro pro leu phe 

CAG CCA CCG GCC CAA GAA GAT 
gin pro pro ala gin glu asp 

GCG GCC GAA CCC AAA TCA TCA 
ala ala glu pro lys ser ser 

ACT ACT CGG CGT GCT ATC GGT 
thr thr arg arg ala ile gly 



SEQ ID N0.I8C 



FIGURE 18C 



part of the nucl eotide sequence of seq!8A 

tTT * 31/n 



61/21 
AAC GTC 
asn va3 
121/41 



181/61 

CGG GCC ATG CCT 
arg ala met pro 
241/81 

TCG CGG CCC GAC 
ser arg pro asp 
301/101 

TTC TCG ACC TAC 
phe ser thr tyr 
361/121 

CTG GGT GCG ATG ATC 
leu gly ala met ile 



GGT 


CGC 


TCG 


GTC 


CGC 


TGG 


TAT CTC 


GTG 


TTC 


AGC 


CAG 


CCA 


GCG 


GCC 


GTT 


gly 


arg 


ser 


val 


arg 


trp 


tyr leu 


val 


phe 


ser 


gin 


pro 


ala 


ala 


val 












91/31 


















CAG 


GTC 


GTC 


TTG 


GGG 


TCG 


GGC ATC 


AGC 


GTC 


GAT 


GTG 


GCT 


CAG 


GTC 


gat' 


gin 


val 


val 


leu 


gly 


ser 


gly ile 


ser 


val 


asp 


val 


ala 


gin 


val 


asp 












151/51 


















GGC 


AAG 


TGT 


CAC 


CCC 


GCC 


ATC CTT 


CCA 


CCT 


CTT 


TTC 


GGG 


TGC 


AAC 


GAT 


gly 


lys 


cys 


his 


pro 


ala 


ile leu 


pro 


pro 


leu 


phe 


gly 


cys 


asn 


asp 










211/71 


















GAC 


GGG 


GAG 


CAG 


AGC 


CAG 


CCA CCG 


GCC 


CAA 


GAA 


GAT 


GCG 


GAA 


GAC 


GAC 


asp 


gly 


glu 


gin 


ser 


gin 


pro pro 


ala 


gin 


glu 


asp 


ala 


glu 


asp 


asp 










271/91 


















GCC 


GCG 


GAG 


GCC 


GCC 


GCG 


GCC GAA 


CCC 


AAA 


TCA 


TCA 


GCC 


GGT 


CCG 


ATG 


ala 


ala 


glu 


ala 


ala 


ala 


ala glu 


pro 


lys 


ser 


ser 


ala 


gly 


pro 


met 












331/111 
















GTG 


GGT 


ATC 


GCC 


TCG 


ACA 


CTA 


CTC GGC 


GTG 


CTA 


TCG 


GTC 


GCC 


GCG 


GTC 


gly 


ile 


ala 


ser 


thr 


leu 


leu gly 


val 


leu 


ser 


val 


ala 


ala 


val 


val 



SEQ ID No. 18A' 



FIGURE 18A f 



REPLACEMENT SHEET (RULE 26) 



64/185 



1/1 

CGG AAG GCG CCG 
arg lys ala pro 
61/21 

TTA ACG TGG CCG 
leu thr trp pro 
121/41 

ATA CCC GAG GGG 
ile pro glu gly 
181/61 

ATC GGG CCA TGC 
ile gly pro cys 
241/81 

ACT CGC GGC CCG 
thr arg gly pro 
301/101 

TGT TCT CGA CCT 
cys ser arg pro 
361/121 

TGC TGG GTG CGA 
cys trp val arg 



TCG GTC GCT CGG 
ser val ala arg 

AAC AGG TCG TCT 
asn arg ser ser 

ATG GCA AGT GTC 
met ala ser val 

CTG ACG GGG AGC 
leu thr gly ser 

ACG CCG CGG AGG 
thr pro arg arg 

ACG GTA TCG CCT 
thr val ser pro 

TGA TC 
OPA 



31/11 
TCC GCT GGT ATC 
ser ala gly ile 

91/31 
TGG GGT CGG GCA 
trp gly arg ala 

151/51 
ACC CCG CCA TCC 
thr pro pro ser 

211/71 
AGA GCC AGC CAC 
arg ala ser his 

271/91 
CCG CCG CGG CCG 
pro pro arg pro 

331/111 
CGA CAC TAC TCG 
arg his tyr ser 



TCG TGT TCA GCC 
ser cys ser ala 

TCA GCG TCG ATG 
ser ala ser met 

TTC CAC CTC TTT 
phe his leu phe 

CGG CCC AAG AAG 
arg pro lys lys 

AAC CCA AAT CAT 
asn pro asn his 

GCG TGC TAT CGG 
ala cys tyr arg 



AGC CAG CGG CCG 
ser gin arg pro 

TGG CTC AGG TCG 
trp leu arg ser 

TCG GGT GCA ACG 
ser gly ala thr 

ATG CGG AAG ACG 
met arg lys thr 

CAG CCG GTC CGA 
gin pro val arg 

TCG CCG CGG TCG 
ser pro arg ser 



SEQ ID No.l8B f 



FIGURE 18B f 



1/1 

GGA AGG CGC CGT CGG TCG CTC GGT CCG CTG 
gly arg arg arg arg ser leu gly pro leu 
61/21 

TAA CGT GGC CGA ACA GGT CGT CTT GGG GTC 
OCH arg gly arg thr gly arg leu gly val 
121/41 

TAC CCG AGG GGA TGG CAA GTG TCA CCC CGC 
tyr pro arg gly trp gin val ser pro arg 
181/61 

TCG GGC CAT GCC TGA CGG GGA GCA GAG CCA 
ser gly his ala OPA arg gly ala glu pro 
241/81 

CTC GCG GCC CGA CGC CGC GGA GGC CGC CGC 
leu ala ala arg arg arg gly gly arg arg 
301/101 

GTT CTC GAC CTA CGG TAT CGC CTC GAC ACT 
val leu asp leu arg tyr arg leu asp thr 
361/121 

GCT GGG TGC GAT GAT C 
ala gly cys asp asp 



31/11 


















GTA TCT 


CGT 


GTT 


CAG 


CCA 


GCC 


AGC 


GGC 


CGT 


val ser 


arg 


val 


gin 


pro 


ala 


ser 


gly 


arg 


91/31 


















GGG CAT 


CAG 


CGT 


CGA 


TGT 


GGC 


TCA 


GGT 


CGA 


gly his 


gin 


arg 


arg 


cys 


gly 


ser 


gly 


arg 


151/51 


















CAT CCT 


TCC 


ACC 


TCT 


TTT 


CGG 


GTG 


CAA 


CGA 


his pro 


ser 


thr 


ser 


phe 


arg 


val 


gin 


arg 


211/71 


















GCC ACC 


GGC 


CCA 


AGA 


AGA 


TGC 


GGA 


AGA 


CGA 


ala thr 


gly 


pro 


arg 


arg 


cys 


gly 


arg 


arg 


271/91 


















GGC CGA 


ACC 


CAA 


ATC 


ATC 


AGC 


CGG 


TCC 


GAT 


gly arg 


thr 


gin 


ile 


ile 


ser 


arg 


ser 


asp 


331/111 


















ACT CGG 


CGT 


GCT 


ATC 


GGT 


CGC 


CGC 


GGT 


CGT 


thr arg 


arg 


ala 


ile 


gly 


arg 


arg 


gly 


arg 



SEQ ID N0.I8C 



FIGURE 18C f 



REPLACEMENT SHEET (RULE 26) 



65/185 



sequence 


: Rv0199 


predicted by Cole 


et 


al. (Nature 


393 


: 537- 


-544) and containing 


seql8A* 




































1/1 


















31/11 


















atg cct 


gae 


999 


gag 


cag 


age 


cag 


cca 


ccg 


gcc eaa 


gaa 


gat 


gcg 


gaa 


gae 


aac 


tea 


eaa 


Met piro 


asp 


y xy 


glu 


gin 


ser 


gin 


pro 


pro 


ala gin 


alu 


asp 


ala 


glu 


asp 


asp 


ser 


arg 


61/21 


















91/31 


















ccc gae 


gee 


gcg 


era rr 
y ciy 


gee 




a en 


gcc 


gaa 


ccc aaa 


tea 


tea 


gcc 


ggt 


ccg 


atq 


ttc 


tea 

3 


pro asp 


ala 


ala 


glu 


ala 


ala 


ala 


ala 


glu 


pro lys 


ser 


ser 


ala 


gly pro 


met 


phe 


ser 


121/41 


















151/51 


















acc tac 


ggt 


ate 


gcc 


teg 


aca 


eta 


etc 


ggc 


ata eta 


tea 


gtc 


gcc 


gcg 


gtc 


gtg 


ctg 


ggt 


thr tyr 


y j-y 


ile 


ala 


ser 


thr 


leu 


leu 


gly 


val leu 


ser 


val 


ala 


ala 


val 


val 


leu 


gly 


181/61 
















211/71 


















gcg atg 


ate 


i_y y 


tec 


gea 


cac 


cac 


aat 


gae 


tec qqc 

3 3 


aaa 

3 3 


cgt 


acc 


tac 


ctg 


acc 


egg 


gtc 


ala met 


ile 




ser 


ala 


his 


ara 


asp 


asp 


ser gly 


glu 


arg 


thr 


tyr 


leu 


thr 


arg 


val 


241/81 


















271/91 


















atg ctg 


ace 


gcc 


get 


gaa 


taa 


acg 


acc 


gtg 


ctg ate 


aac 


atg 


aac 


gcc 


gae 


aac 


ate 


gat 


met leu 


thr 


ala 


ala 


alu 
^ j. « 


t rp 


thr 


ala 


val 


leu ile 


asn 


met 


asn 


ala 


asp 


asn 


ile 


asp 


301/101 


















331/111 


















gcc age 


ctg 


cag 


cga 


ctg 


cac 


aac 


aaa 


acg 


ate aat 


eaa 


etc 


aac 


acc 


gae 


ttc 


gae 


get 


ala ser 


leu 


gin 


arg 


leu 


his 


asp 


qly 


thr 


val gly 


gin 


leu 


asn 


thr 


asp 


phe 


asp 


ala 


361/121 


















391/131 


















ate atg 


cag 


ccc 


tac 


egg 


cag 


gtg 


gtg 


gag 


aag ttg 


egg 


acg 


cac 


age 


age 


ggc 


agg 


ate 


val val 


gin 


pro 


tyr 


arg 


gin 


val 


val 


glu 


lys leu 


arg 


thr 


his 


ser 


ser 


gly 


arg 


ile 


421/141 














451/151 


















gag gcg 


gta 


gcg 


ate 


gat 


acg 


gtg 


cac 


cgc 


gag ctg 


gat 


acc 


cag 


tec 


ggt 


gcc 


gcc 


cga 


glu ala 


val 


ala 


ile 


asp 


thr 


val 


his 


arg 


glu leu 


asp 


thr 


gin 


ser 


gly 


ala 


ala 


arg 


481/161 


















511/171 


















ccg gta 


gta 


acc 


acg 


aaa 


ttg 


cca 


ccg 


ttt 


gcc act 


cgc 


acc 


gae 


teg 


gtg 


ctg 


ctg 


gtc 


pro val 


val 


thr 


thr 


lys 


leu 


pro 


pro 


phe 


ala thr 


arg 


thr 


asp 


ser 


val 


leu 


leu 


val 


541/181 


















571/191 


















gcg acg 


teg 


gtc 


agt 


gag 


aac 


gcc 


ggc 


gcc 


aaa ccc 


cag 


acc 


gtg 


cac 


tgg 


aac 


ttg 


egg 


ala thr 


ser 


val 


ser 


glu 


asn 


ala 


gly 


ala 


lys pro 


gin 


thr 


val 


his 


trp 


asn 


leu 


arg 


601/201 


















631/211 


















etc gat 


gtc 


tec 


gat 


gtg 


gae 


ggc 


aag 


ctg 


atg ate 


tec 


egg 


ttg 


gag 


teg 


att 


cga 


tga 


leu asp 


val 


ser 


asp 


val 


asp 


gly 


lys 


leu 


met ile 


ser 


arg 


leu 


glu 


ser 


ile 


arg 


OPA 



SEQ ID N0.I8D 
FIGURE 18D 



REPLACEMENT SHEET (RULE 26) 



66/185 



ORF according to Cole et al . (Nature 393:537-544) and containing Rv0199 
1/1 31/11 

taa tec gat gec gga ttg ggt gaa atg cac caa gta acg ggt cga gtc ttt gga ate ggt 
OCH ser asp ala gly leu gly glu met his gin val thr gly arg val phe gly ile gly 
61/21 91/31 

ate gac ata gac tec gat gec gec gec cac gec ggc acg ttg cag agt gee aag ggc ggc 
ile asp ile asp ser asp ala ala ala his ala gly thr leu gin ser ala lys gly gly 
121/41 151/51 

ggc caa ttc ggt ggc gtc ggc cgc get gtc aat cgt ggc caa ttc gtc gtg cag egg ttg 
qly gin phe gly gly val gly arg ala val asn arg gly gin phe val val gin arg leu 
181/61 211/71 

cac ccc tgc gcg etc gac ggc ttc etc gtc gag gaa get ggc gta gag gtc gec gat gcg 
his pro cys ala leu asp gly phe leu val glu glu ala gly val glu val ala asp ala 
241/81 271/91 

ctg cgc ate ggt gec tac cgc age acc tgc ttg get ggc ctg gat gat cag gtc teg cac 
leu arg ile gly ala tyr arg ser thr cys leu ala gly leu asp asp gin val ser his 
301/101 331/111 

ttg tgt etc ggc gcg gtc gaa cag get acg gaa ggc gee gtc ggt cgc teg gtc cgc tgg 
leu cys leu gly ala val glu gin ala thr glu gly ala val gly arg ser val arg trp 
361/121 391/131 

tat etc gtg ttc age cag cca gcg gec gtt aac gtg gec gaa cag gtc gtc ttg ggg teg 
tyr leu val phe ser gin pro ala ala val asn val ala glu gin val val leu gly ser 
421/141 451/151 

ggc ate age gtc gat gtg get cag gtc gat acc cga ggg gat ggc aag tgt cac ccc gee 
gly ile ser val asp val ala gin val asp thr arg gly asp gly lys cys his pro ala 
481/161 511/171 

ate ctt cca cct ctt ttc ggg tgc aac gat egg gee atg cct gac ggg gag cag age cag 
ile leu pro pro leu phe gly cys asn asp arg ala met pro asp gly glu gin ser gin 
541/181 571/191 

cca ccg gee caa gaa gat gcg gaa gac gac teg egg ccc gac gec gcg gag gec gec gcg 
pro pro ala gin glu asp ala glu asp asp ser arg pro asp ala ala glu ala ala ala 
601/201 631/211 

gee gaa ccc aaa tea tea gee ggt ccg atg ttc teg acc tac ggt ate gec teg aca eta 
ala glu pro lys ser ser ala gly pro met phe ser thr tyr gly ile ala ser thr leu 
661/221 691/231 

etc ggc gtg eta teg gtc gee gcg gtc gtg ctg ggt gcg atg ate tgg tec gca cac cgc 
leu gly val leu ser val ala ala val val leu gly ala met ile trp ser ala his arg 
721/241 751/251 

gat gac tec ggc gag cgt acc tac ctg acc egg gtc atg ctg acc gec get gaa tgg acg 
asp asp ser gly glu arg thr tyr leu thr arg val met leu thr ala ala glu trp thr 
781/261 811/271 

gec gtg ctg ate aac atg aac gec gac aac ate gat gec age ctg cag cga ctg cac gac 
ala val leu ile asn met asn ala asp asn ile asp ala ser leu gin arg leu his asp 
841/281 871/291 

gga acg gtc ggt caa etc aac acc gac ttc gac get gtc gtg cag ccc tac egg cag gtg 
gly thr val gly gin leu asn thr asp phe asp ala val val gin pro tyr arg gin val 
901/301 931/311 

gtg gag aag ttg egg acg cac age age ggc agg ate gag gcg gta gcg ate gat acg gtg 
val glu lys leu arg thr his ser ser gly arg ile glu ala val ala ile asp thr val 
961/321 991/331 

cac cgc gag ctg gat acc cag tec ggt gec gec cga ccg gta gta acc acg aaa ttg cca 
his arg glu leu asp thr gin ser gly ala ala arg pro val val thr thr lys leu pro 
1021/341 1051/351 

ccg ttt gee act cgc acc gac teg gtg ctg ctg gtc gcg acg teg gtc agt gag aac gec 
pro phe ala thr arg thr asp ser val leu leu val ala thr ser val ser glu asn ala 
1081/361 1111/371 

ggc gee aaa ccc cag acc gtg cac tgg aac ttg egg etc gat gtc tec gat gtg gac ggc 
alv ala lys pro gin thr val his trp asn leu arg leu asp val ser asp val asp gly 
1141/381 1171/391 
aag ctg atg ate tec egg ttg gag teg att cga tga 
lys leu met ile ser arg leu glu ser ile arg OPA 

SEQ ID N0.I8F 



FIGURE 18F 



REPLACEMENT SHEET (RULE 2€) 



67/185 





















31/11 


















GTT GCG 


CAA 


CGG 


GGT 


GAG 


CAC 


CGA 


CGC 


GAT 


GAT GGC 


GCA 


ACT 


ATC 


GAA 


ACT 


GCA 


GGA 


CAT 


V d J- CI JL d 


gin 


arg 


alv 
\^ j. j 


crlu 

\j J. V* 


his 


arg 


arg 


asp 


asp gly 


ala 


thr 


ile 


glu 


thr 


ala 


gly 


his 


61/21 


















91/31 


















CGC CAA 


CGC 


CAA 


CGA 


CGG 


CAC 


TCG 


CGC 


GGT 


GGG CAC 


CCC 


TGG 


CTA 


TCA 


GGC 


CAG 


CGT 


CGA 


arn nln 


ara 


qln 


arg 


arg 


his 


ser 


arg 


gly 


gly his 


pro 


trp 


leu 


ser 


gly 


gin 


arg 


arg 


121/41 


















151/51 


















CTA TGT 


GGT 


AAA 


CAC 


ACT 


GCG 


CAA 


CAG 


CGG 


TTT TGA 


TGT 


GCA 


AAC 


CCC 


GGA 


GTT 


CTC 


CGC 


leu cys 


gly 


lys 


his 


thr 


ala 


gin 


gin 


arg 


phe OPA 


cys 


ala 


asn 


pro 


gly 


val 


leu 


arg 


181/61 














211/71 


















TCG CGT 


GTT 


CAA 


GGC 


CGA 


AAA 


AGG 


GGT 


GGT 


GAC CCT 


CGG 


CGG 


CAA 


CAC 


CGT 


GGA 


GGC 


GAG 


ser arg 


val 


gin 


gly 


arg 


lys 


arg 


gly 


gly 


asp pro 


arg 


arg 


gin 


his 


arg 


gly 


gly 


glu 


241/81 


















271/91 


















GGC GCT 


CGA 


GTA 


CAG 


CCT 


CGG 


CAC 


ACC 


GCC 


GGA CGG 


GGT 


GAC 


GGG 


CCC 


GCT 


GGT 


GGC 


TGC 


gly ala 


arg 


val 


gin 


pro 


arg 


his 


thr 


ala 


gly arg 


gly 


asp 


gly pro 


ala 


gly 


gly 


cys 


301/101 


















331/111 


















CGC CGC 


CGA 


CGA 


CAG 


TCC 


GGG 


CTG 


CAG 


TCC 


GTC GGA 


CTA 


CGA 


CAG 


GCT 


GCC 


GGT 


GTC 


CGG 


pro arg 


arg 


arg 


gin 


ser 


gly 


leu 


gin 


ser 


val gly 


leu 


arg 


gin 


ala 


ala 


gly 


val 


arg 



361/121 

TGC GGT GGT GCT GGT AGA TC 
cys gly gly ala gly arg 

SEQ ID No.l9A 
FIGURE 19A 





















32/11 


















TTG CGC 


AAC 


GGG 


GTG 


AGC 


ACC 


GAC 


GCG 


ATG 


ATG GCG 


CAA 


CTA 


TCG 


AAA 


CTG 


CAG 


GAC 


ATC 


leu arg 


asn 


gly 


val 


ser 


thr 


asp 


ala 


met 


met ala 


gin 


leu 


ser 


lys 


leu 


gin 


asp 


ile 


62/21 
















92/31 


















GCC AAC 


GCC 


AAC 


GAC 


GGC 


ACT 


CGC 


GCG 


GTG 


GGC ACC 


CCT 


GGC 


TAT 


CAG 


GCC 


AGC 


GTC 


GAC 


ala asn 


ala 


asn 


asp 


gly 


thr 


arg 


ala 


val 


gly thr 


pro 


gly 


tyr 


gin 


ala 


ser 


val 


asp 


122/41 


















152/51 


















TAT GTG 


GTA 


AAC 


ACA 


CTG 


CGC 


AAC 


AGC 


GGT 


TTT GAT 


GTG 


CAA 


ACC 


CCG 


GAG 


TTC 


TCC 


GCT 


tyr val 


val 


asn 


thr 


leu 


arg 


asn 


ser 


gly 


phe asp 


val 


gin 


thr 


pro 


glu 


phe 


ser 


ala 


182/61 


















212/71 


















CGC GTG 


TTC 


AAG 


GCC 


GAA 


AAA 


GGG 


GTG 


GTG 


ACC CTC 


GGC 


GGC 


AAC 


ACC 


GTG 


GAG 


GCG 


AGG 


arg val 


phe 


lys 


ala 


glu 


lys 


gly 


val 


val 


thr leu 


gly 


gly 


asn 


thr 


val 


glu 


ala 


arg 


242/81 














272/91 


















GCG CTC 


GAG 


TAC 


AGC 


CTC 


GGC 


ACA 


CCG 


CCG 


GAC GGG 


GTG 


ACG 


GGC 


CCG 


CTG 


GTG 


GCT 


GCC 


ala leu 


glu 


tyr 


ser 


leu 


gly 


thr 


pro 


pro 


asp gly 


val 


thr 


gly 


pro 


leu 


val 


ala 


ala 


302/101 














332/111 














TCC 


GGT 


CCC GCC 


GAC 


GAC 


AGT 


CCG 


GGC 


TGC 


AGT 


CCG 


TCG GAC 


TAC 


GAC 


AGG 


CTG 


CCG 


GTG 


pro ala 


asp 


asp 


ser 


pro 


gly 


cys 


ser 


pro 


ser asp 


tyr 


asp 


arg 


leu 


pro 


val 


ser 


gly 



362/121 

GCG GTG GTG CTG GTA GAT C 
ala val val leu val asp 



SEQ ID No.l9B 
FIGURE 19B 



REPLACEMENT SHEET (RULE 26) 



68/185 



TGC GCA ACG GGG TGA GCA 
cys ala thr gly OPA ala 
63/21 

CCA ACG CCA ACG ACG GCA 
pro thr pro thr thr ala 
123/41 

ATG TGG TAA ACA CAC TGC 
met trp OCH thr his cys 
183/61 

GCG TGT TCA AGG CCG AAA 
ala cys ser arg pro lys 
243/81 

CGC TCG AGT ACA GCC TCG 
arg ser ser thr ala ser 
303/101 

CCG CCG ACG ACA GTC CGG 
pro pro thr thr val arg 
363/121 

CGG TGG TGC TGG TAG ATC 
arg trp cys trp AMB ile 











33/11 




CCG 


ACG 


CGA 


TGA 


TGG CGC 


AAC 


pro 


thr 


arg 


OPA 


trp arg 


asn 










93/31 




CTC 


GCG 


CGG 


TGG 


GCA CCC 


CTG 


leu 


ala 


arg 


trp 


ala pro 


leu 










153/51 




GCA 


ACA 


GCG 


GTT 


TTG ATG 


TGC 


ala 


thr 


ala 


val 


leu met 


cys 










213/71 




AAG 


GGG 


TGG 


TGA 


CCC TCG 


GCG 


lys 


gly 


trp 


OPA 


pro ser 


ala 










273/91 




GCA 


CAC 


CGC 


CGG 


ACG GGG 


TGA 


ala 


his 


arg 


arg 


thr gly 


OPA 










333/111 




GCT 


GCA 


GTC 


CGT 


CGG ACT 


ACG 


ala 


ala 


val 


arg 


arg thr 


thr 



SEQ ID No. 19C 
FIGURE 19C 



TAT CGA AAC TGC AGG ACA TCG 
tyr arg asn cys arg thr ser 

GCT ATC AGG CCA GCG TCG ACT 
ala ile arg pro ala ser thr 

AAA CCC CGG AGT TCT CCG CTC 
lys pro arg ser ser pro leu 

GCA ACA CCG TGG AGG CGA GGG 
ala thr pro trp arg arg gly 

CGG GCC CGC TGG TGG CTG CCC 
arg ala arg trp trp leu pro 

ACA GGC TGC CGG TGT CCG GTG 
thr gly cys arg cys pro val 



part of 


the 


nucleotide sequence 


of seql9A 










1/1 


















31/11 










CTA TCG 


AAA 


CTG 


CAG 


GAC 


ATC 


GCC 


AAC 


GCC 


AAC GAC 


GGC 


ACT 


CGC 


GCG 


leu ser 


lys 


leu 


gin 


asp 


ile 


ala 


asn 


ala 


asn asp 


giy 


thr 


arg 


ala 


61/21 














91/31 










GGC TAT 


CAG 


GCC 


AGC 


GTC 


GAC 


TAT 


GTG 


GTA 


AAC ACA 


CTG 


CGC 


AAC 


AGC 


gly tyr 


gin 


ala 


ser 


val 


asp 


tyr 


val 


val 


asn thr 


leu 


arg 


asn 


ser 


121/41 
















151/51 










CAA ACC 


CCG 


GAG 


TTC 


TCC 


GCT 


CGC 


GTG 


TTC 


AAG GCC 


GAA 


AAA 


GGG 


GTG 


gin thr 


pro 


glu 


phe 


ser 


ala 


arg 


val 


phe 


lys ala 


glu 


lys 


gly 


val 


181/61 














211/71 










GGC AAC 


ACC 


GTG 


GAG 


GCG 


AGG 


GCG 


CTC 


GAG 


TAC AGC 


CTC 


GGC 


ACA 


CCG 


gly asn 


thr 


val 


glu 


ala 


arg 


ala 


leu 


glu 


tyr ser 


leu 


gly 


thr 


pro 


241/81 


















271/91 








AGT 


ACG GGC 


CCG 


CTG 


GTG 


GCT 


GCC 


CCC 


GCC 


GAC 


GAC AGT 


CCG 


GGC 


TGC 


thr gly 


pro 


leu 


val 


ala 


ala 


pro 


ala 


asp 


asp ser 


pro 


gly 


cys 


ser 


301/101 
















331/111 










GAC AGG 


CTG 


CCG 


GTG 


TCC 


GGT 


GCG 


GTG 


GTG 


CTG GTA 


GAT 


C 






asp arg 


leu 


pro 


val 


ser 


gly 


ala 


val 


val 


leu val 


asp 









SEQ ID No.l9A f 
FIGURE 19A 



REPLACEMENT SHEET (RULE 26) 



69/185 



1/1 


















31/11 


















TAT CGA 


AAC 


TGC 


AGG 


ACA 


TCG 


CCA 


ACG 


CCA 


ACG ACG 


GCA 


CTC 


GCG 


CGG 


TGG 


GCA 


CCC 


CTG 


tvr arg 


asn 


cys 


arg 


thr 


ser 


pro 


thr 


pro 


thr thr 


ala 


leu 


ala 


arg 


trp 


ala 


pro 


leu 


61/21 


















91/31 


















GCT ATC 


AGG 


CCA 


GCG 


TCG 


ACT 


ATG 


TGG 


TAA 


ACA CAC 


TGC 


GCA 


ACA 


GCG 


GTT 


TTG 


ATG 


TGC 


ala ile 


arg 


pro 


ala 


ser 


thr 


met 


trp 


OCH 


thr his 


cys 


ala 


thr 


ala 


val 


leu 


met 


cys 


121/41 














151/51 


















AAA CCC 


CGG 


AGT 


TCT 


CCG 


CTC 


GCG 


TGT 


TCA 


AGG CCG 


AAA 


AAG 


GGG 


TGG 


TGA 


CCC 


TCG 


GCG 


lys pro 


arg 


ser 


ser 


pro 


leu 


ala 


cys 


ser 


arg pro 


lys 


lys 


gly 


trp 


OPA 


pro 


ser 


ala 


181/61 
















211/71 


















GCA ACA 


CCG 


TGG 


AGG 


CGA 


GGG 


CGC 


TCG 


AGT 


ACA GCC 


TCG 


GCA 


CAC 


CGC 


CGG 


ACG 


GGG 


TGA 


ala thr 


pro 


trp 


arg 


arg 


gly 


arg 


ser 


ser 


thr ala 


ser 


ala 


his 


arg 


arg 


thr 


gly 


OPA 


241/81 
















271/91 


















CGG GCC 


CGC 


TGG 


TGG 


CTG 


CCC 


CCG 


CCG 


ACG 


ACA GTC 


CGG 


GCT 


GCA 


GTC 


CGT 


CGG 


ACT 


ACG 


arg ala 


arg 


trp 


trp 


leu 


pro 


pro 


pro 


thr 


thr val 


arg 


ala 


ala 


val 


arg 


arg 


thr 


thr 


301/101 


















331/111 


















ACA GGC 


TGC 


CGG 


TGT 


CCG 


GTG 


CGG 


TGG 


TGC 


TGG TAG 


ATC 
















thr gly 


cys 


arg 


cys 


pro 


val 


arg 


trp 


cys 


trp AMB 


ile 

















SEQ ID N0.19B' 
FIGURE 19B f 



1/1 


















31/11 


















ATC GAA 


ACT 


GCA 


GGA 


CAT 


CGC 


CAA 


CGC 


CAA 


CGA CGG 


CAC 


TCG 


CGC 


GGT 


GGG 


CAC 


CCC 


TGG 


ile glu 


thr 


ala 


gly 


his 


arg 


gin 


arg 


gin 


arg arg 


his 


ser 


arg 


gly 


gly 


his 


pro 


trp 


61/21 


















91/31 


















CTA TCA 


GGC 


CAG 


CGT 


CGA 


CTA 


TGT 


GGT 


AAA 


CAC ACT 


GCG 


CAA 


CAG 


CGG 


TTT 


TGA 


TGT 


GCA 


leu ser 


gly 


gin 


arg 


arg 


leu 


cys 


gly 


lys 


his thr 


ala 


gin 


gin 


arg 


phe 


OPA 


cys 


ala 


121/41 
















151/51 


















AAC CCC 


GGA 


GTT 


CTC 


CGC 


TCG 


CGT 


GTT 


CAA 


GGC CGA 


AAA 


AGG 


GGT 


GGT 


GAC 


CCT 


CGG 


CGG 


asn pro 


gly 


val 


leu 


arg 


ser 


arg 


val 


gin 


gly arg 


lys 


arg 


gly 


gly 


asp 


pro 


arg 


arg 


181/61 
















211/71 


















CAA CAC 


CGT 


GGA 


GGC 


GAG 


GGC 


GCT 


CGA 


GTA 


CAG CCT 


CGG 


CAC 


ACC 


GCC 


GGA 


CGG 


GGT 


GAC 


gin his 


arg 


gly 


gly 


glu 


gly 


ala 


arg 


val 


gin pro 


arg 


his 


thr 


ala 


gly 


arg 


gly 


asp 


241/81 


















271/91 


















GGG CCC 


GCT 


GGT 


GGC 


TGC 


CCC 


CGC 


CGA 


CGA 


CAG TCC 


GGG 


CTG 


CAG 


TCC 


GTC 


GGA 


CTA 


CGA 


gly pro 


ala 


gly 


gly 


cys 


pro 


arg 


arg 


arg 


gin ser 


gly 


leu 


gin 


ser 


val 


gly 


leu 


arg 


301/101 


















331/111 


















CAG GCT 


GCC 


GGT 


GTC 


CGG 


TGC 


GGT 


GGT 


GCT 


GGT AGA 


TC 
















gin ala 


ala 


gly 


val 


arg 


cys 


gly 


gly 


ala 


gly arg 
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sequence Rv0418 predicted by Cole et al. (Nature 393:537-544) and containing 
seql9A' 

1/1 31/11 

atg gtg aac aaa tec agg atg atg ccg gcg gtg ctg gcc gtg get gtg gtc gte gca ttc 
Met val asn lys ser arg met met pro ala val leu ala val ala val val val ala phe 
61/21 91/31 

ctg acg acg ggc tgt ate egg tgg tct acg cag teg egg ccc gtt gtt aac ggc ccc get 
leu thr thr gly cys ile arg trp ser thr gin ser arg pro val val asn gly pro ala 
121/41 151/51 

gcc gca gag ttc gcc gtt gcg ttg cgc aac egg gtg age acc gac gcg atg atg gcg cac 
ala ala glu phe ala val ala leu arg asn arg val ser thr asp ala met met ala his 
181/61 211/71 

eta teg aaa ctg cag gac ate gcc aac gcc aac gac ggc act cgc gcg gtg ggc acc cct 
leu ser lys leu gin asp ile ala asn ala asn asp gly thr arg ala val gly thr pro 
241/81 271/91 

ggc tat cag gcc age gtc gac tat gtg gta aac aca ctg cgc aac age ggt ttt gat gtg 
gly tyr gin ala ser val asp tyr val val asn thr leu arg asn ser gly phe asp val 
301/101 331/111 

caa acc ccg gag ttc tec get cgc gtg ttc aag gcc gaa aaa ggg gtg gtg acc etc ggc 
gin thr pro glu phe ser ala arg val phe lys ala glu lys gly val val thr leu gly 
361/121 391/131 

ggc aac acc gtg gag gcg agg gcg etc gag tac age etc ggc aca ccg ccg gac ggg gtg 
gly asn thr val glu ala arg ala leu glu tyr ser leu gly thr pro pro asp gly val 
421/141 451/151 

acg ggc ccg ctg gtg get gcc ccc gcc gac gac agt ccg ggc tgc agt ccg teg gac tac 
thr gly pro leu val ala ala pro ala asp asp ser pro gly cys ser pro ser asp tyr 
481/161 511/171 

gac agg ctg ccg gtg tec ggt gcg gtg gtg ctg gta gat cgc ggc gtc tgt cct ttt gcc 
asp arg leu pro val ser gly ala val val leu val asp arg gly val cys pro phe ala 
541/181 571/191 

cag aag gaa gac gca gcc gcg cag cgc ggt gcg gtg gcg ctg ate att get gac aac ate 
gin lys glu asp ala ala ala gin arg gly ala val ala leu ile ile ala asp asn ile 
601/201 631/211 

gac gag cag gcg atg ggc ggc acc ctg ggg get aat acc gac gtc aag ate ccg gtg gtg 
asp glu gin ala met gly gly thr leu gly ala asn thr asp val lys ile pro val val 
661/221 691/231 

agt gtc acc aag teg gtc gga ttc cag eta cgc gga cag tct ggg cca acc acc gtc aag 
ser val thr lys ser val gly phe gin leu arg gly gin ser gly pro thr thr val lys 
721/241 751/251 

etc acg gcg age acc caa agt ttc aag gcc cgc aac gtc ate gcg cag acg aag acg ggg 
leu thr ala ser thr gin ser phe lys ala arg asn val ile ala gin thr lys thr gly 
781/261 811/271 

teg teg gcc aac gtg gtg atg gca ggt gcg cat ttg gac age gtt ccg gaa gga ccc ggc 
ser ser ala asn val val met ala gly ala his leu asp ser val pro glu gly pro gly 
841/281 871/291 

ate aac gac aac ggc teg gga gtg get gcg gtt ctg gaa acg gca gtg cag ctg ggg aac 
ile asn asp asn gly ser gly val ala ala val leu glu thr ala val gin leu gly asn 
901/301 931/311 

tea ccg cat gtg tec aac gcg gta egg ttc gcc ttc tgg ggc gcc gag gaa ttc ggc ctg 
ser pro his val ser asn ala val arg phe ala phe trp gly ala glu glu phe gly leu 
961/321 991/331 

att ggg tea cga aac tac gtc gag teg ctg gac ate gac gcg etc aaa ggc ate gcg ccg 
ile gly ser arg asn tyr val glu ser leu asp ile asp ala leu lys gly ile ala leu 

SEQ ID No, 19 D 
FIGURE 19D 
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ORF according to Cole et al . (Nature 393:537-544) and containing Rv0418 
1/1 31/11 

tag gcc att caa cgc tct gtt cgt ttg att ggt egg tgg gat gcg aaa get gcg egg cga 
AMB ala ile gin arg ..ser val arg leu ile gly arg trp asp ala lys ala ala arg arg 
61/21 '•' ' 91/31 

cag gcg egg tct aat ctg ggc gcg atg gtg aac aaa tec agg atg atg ccg gcg gtg ctg 
gin ala arg ser asn leu gly ala met val asn lys ser arg met met pro ala val leu 
121/41 151/51 

gcc gtg get gtg gtc gtc gca ttc ctg acg acg ggc tgt ate egg tgg tct acg cag teg 
ala val ala val val val ala phe leu thr thr gly cys ile arg trp ser thr gin ser 
181/61 211/71 

cqg ccc gtt gtt aac ggc ccc get gcc gca gag ttc gcc gtt gcg ttg cgc aac egg gtg 
arg pro val val asn gly pro ala ala ala glu phe ala val ala leu arg asn arg val 
241/81 271/91 

age acc gac gcg atg atg gcg cac eta teg aaa ctg cag gac ate gcc aac gcc aac gac 
ser thr asp ala met met ala his leu ser lys leu gin asp ile ala asn ala asn asp 
301/101 331/111 

ggc act cgc gcg gtg ggc acc cct ggc tat cag gcc age gtc gac tat gtg gta aac aca 
qlv thr arg ala val gly thr pro gly tyr gin ala ser val asp tyr val val asn thr 
361/121 391/131 

ctg cgc aac age ggt ttt gat gtg caa acc ccg gag ttc tec get cgc gtg ttc aag gcc 
leu arg asn ser gly phe asp val gin thr pro glu phe ser ala arg val phe lys ala 
421/141 451/151 

gaa aaa ggg gtg gtg acc etc ggc ggc aac acc gtg gag gcg agg gcg etc gag tac age 
qlu lys gly val val thr leu gly gly asn thr val glu ala arg ala leu glu tyr ser 
481/161 511/171 



etc ggc aca ccg ccg gac ggg gtg acg ggc ccg ctg gtg get gcc ccc gcc gac gac agt 
leu qly thr pro pro asp gly val thr gly pro leu val ala ala pro ala asp asp ser 
541/181 571/191 

ccg ggc tgc agt ccg teg gac tac gac agg ctg ccg gtg tec ggt gcg gtg gtg ctg gta 
pro gly cys ser pro ser asp tyr asp arg leu pro val ser gly ala val val leu val 
601/201 631/211 

gat cgc ggc gtc tgt cct ttt gcc cag aag gaa gac gca gcc gcg cag cgc ggt gcg gtg 
asp arg gly val cys pro phe ala gin lys glu asp ala ala ala gin arg gly ala val 
661/221 691/231 

qcg ctg ate att get gac aac ate gac gag cag gcg atg ggc ggc acc ctg ggg get aat 
ala leu ile ile ala asp asn ile asp glu gin ala met gly gly thr leu gly ala asn 
721/241 751/251 

acc gac gtc aag ate ccg gtg gtg agt gtc acc aag teg gtc gga ttc cag eta cgc gga 
thr asp val lys ile pro val val ser val thr lys ser val gly phe gin leu arg gly 
781/261 811/271 

caq tct ggg cca acc acc gtc aag etc acg gcg age acc caa agt ttc aag gcc cgc aac 
gin ser gly pro thr thr val lys leu thr ala ser thr gin ser phe lys ala arg asn 

841/281 871/291 

qtc ate gcg cag acg aag acg ggg teg teg gcc aac gtg gtg atg gca ggt gcg cat ttg 
val ile ala gin thr lys thr gly ser ser ala asn val val met ala gly ala his leu 
901/301 931/311 

qac aqc gtt ccg gaa gga ccc ggc ate aac gac aac ggc teg gga gtg get gcg gtt ctg 
Tsp ser val pro glu giy pro gly ile asn asp asn gly ser gly val ala ala val leu 
961/321 991/331 

gaa acg gca gtg cag ctg ggg aac tea ccg cat gtg tee aac gcg gta egg ttc gcc ttc 
glu thr ala val gin leu gly asn ser pro his val ser asn ala val arg phe ala phe 

SEQ ID No. 19 F 
FIGURE 19F 
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1021/341 1051/351 

tag age gec gag gaa ttc ggc ctg att ggg tea cga aac tac gtc gag teg ctg gac ate 
trp qlv ala glu glu phe gly leu ile gly ser arg asn tyr val glu ser leu asp lie 
1081/361 1111/371 

aac geg etc aaa ggc ate geg ctg tat ctg aac ttc gac atg ttg gcg teg ccg aac ceg 
asn ala leu lys gly ile ala leu tyr leu asn phe asp met leu ala ser pro asn pro 
1141/381 1171/391 

gqt tac ttc acc tac gac ggt gac cag teg ctg ccg eta gac gec cgc ggt cag ccg gtg 
qiy tyr phe thr tyr asp gly asp gin ser leu pro leu asp ala arg gly gin pro val 
1201/401 1231/411 

gtg ccc gaa ggc teg gec ggt ate gag cgc acg ttc gtc gec tat ctg aag atg gee ggc 
val pro glu gly ser ala gly ile glu arg thr phe val ala tyr leu lys met ala gly 
1261/421 1291/431 

aag acc gcg cag gac acc teg ttc gac ggt egg tec gac tac gac ggc ttc acg ctg gcg 
lvs thr ala gin asp thr ser phe asp gly arg ser asp tyr asp gly phe thr leu ala 
1321/441 1351/451 

ggt ate cct teg ggt ggc ctg ttc tec ggc get gag gtc aag aag tec gee gag caa gee 
alv ile pro ser gly gly leu phe ser gly ala glu val lys lys ser ala glu gin ala 
1381/461 1411/471 

gag etc tgg ggc ggc acc gec gac gag cct ttc gat ccc aac tat cac cag aag aca gac 
alu leu trp gly gly thr ala asp glu pro phe asp pro' asn tyr his gin lys thr asp 
1441/481 1471/491 

acc ctg gac cat ate gac cgc acc gcg etc ggt ate aac ggc get ggc gtc gcg tac gcg 
thr leu asp his ile asp arg thr ala leu gly ile asn gly ala gly val ala -tyr ala 
1501/501 1531/511 

gtg ggt ttg tat gcg cag gac etc ggc ggc ccc aac ggg gtt ccg gtc atg gcg gac cgc 
val gly leu tyr ala gin asp leu gly gly pro asn gly val pro val met ala asp arg 
1561/521 

acc cgc cac ctg att gee aaa ccg tga 
thr arg his leu ile ala lys pro OPA 

SEQ ID No.l9F (continued) 
FIGURE 19F (continued) 



CGA GAC AGT GGT GCG 
arg asp ser gly ala 
61/21 

CGC GGT GTG GGA CTC 
arg gly val gly leu 
121/41 

GGC CGG CTG GCA ACC 
gly arg leu ala thr 
181/61 

TGA GCT CCC CGT TTT 
OPA ala pro arg phe 
241/81 

GCA CAT GGT GCC GGC 
ala his gly ala gly 
301/101 

TCG GTG CGT GCA TGC 
ser val arg ala cys 
361/121 

CCG ACA AAT CCG GAC 
pro thr asn pro asp 



GGA CAC TTG AGT TCG 
gly his leu ser ser 

ACG TTC GGT GAG GGT 
thr phe gly glu gly 

GTC GGT TTC GAC GTT 
val gly phe asp val 

GCT GGA TGC CCA GGC 
ala gly cys pro gly 

AGG GAG GAA CAG TGG 
arg glu glu gin trp 

TCG CAG CCG GAT GCA 
ser gin pro asp ala 

CAC TGC ATC AGG ATC 
his cys ile arg ile 

SEQ ID 



31/11 

GCT GCT AAC GAC GCC 
ala ala asn asp ala 
91/31 

ACA GCG GAC CTT CGA 
thr ala asp leu arg 
151/51 

GGT GAC GAC CCC TCG 
gly asp asp pro ser 
211/71 

ACC GCC GGT ACT GCT 
thr ala gly thr ala 
271/91 

GCA AGC AGC TAG CCG 
ala ser ser AMB pro 
331/111 

CCA ACG TGG TCG ACG 
pro thr trp ser thr 



NO.20A 



AGA GTC GCC CGC TTC 
arg val ala arg phe 

GCA CGC AAT ATC GTG 
ala arg asn ile val 

TTC ATG AAT CGT TCT 
phe met asn airg ser 

GCG CTT AAG CTT GTC 
ala leu lys leu val 

CGC TCG CCG CGC TGG 
arg ser pro arg trp 

GGA CCG CCG TGG CTG 
gly pro pro trp leu 



FIGURE 20A 
REPLACEMENT SHEET (RULE 26) 
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32/11 

GAG ACA GTG GTG CGG GAC ACT TGA GTT CGG CTG CTA ACG ACG CCA GAG TCG CCC GCT TCC 
qlu thr val val arg asp thr OPA val arg leu leu thr thr pro glu ser pro ala ser 
62/21 92 /31 

GCG GTG TGG GAC TCA CGT TCG GTG AGG GTA CAG CGG ACC TTC GAG CAC GCA ATA TCG TGG 
ala val trp asp ser arg ser val arg val gin arg thr phe glu his ala ile ser trp 
122/41 152/51 

GCC GGC TGG CAA CCG TCG GTT TCG ACG TTG GTG ACG ACC CCT CGT TCA TGA ATC GTT CTT 
ala qly trp gin pro ser val ser thr leu val thr thr pro arg ser OPA ile val leu 
182/61 212/71 

GAG CTC CCC GTT TTG CTG GAT GCC CAG GCA CCG CCG GTA CTG CTG CGC TTA AGC TTG TCG 
qlu leu pro val leu leu asp ala gin ala pro pro val leu leu arg leu ser leu ser 
242/81 272/91 

CAC ATG GTG CCG GCA GGG AGG AAC AGT GGG CAA GCA GCT AGC CGC GCT CGC CGC GCT GGT 
his met val pro ala gly arg asn ser gly gin ala ala ser arg ala arg arg ala gly 
302/101 332/111 

CGG TGC GTG CAT GCT CGC AGC CGG ATG CAC CAA CGT GGT CGA CGG GAC CGC CGT GGC TGC 
arg cys val his ala arg ser arg met his gin arg gly arg arg asp arg arg gly cys 
362/121 

CGA CAA ATC CGG ACC ACT GCA TCA GGA TC 
arg gin ile arg thr thr ala ser gly 



SEQ ID NO.20B 
FIGURE 20B 



AGA CAG TGG TGC GGG 
arg gin trp cys gly 
63/21 

CGG TGT GGG ACT CAC 
arg cys gly thr his 
123/41 

CCG GCT GGC AAC CGT 
pro ala gly asn arg 
183/61 

AGC TCC CCG TTT TGC 
ser ser pro phe cys 
243/81 

ACA TGG TGC CGG CAG 
thr trp cys arg gin 
303/101 

GGT GCG TGC ATG CTC 
gly ala cys met leu 
363/121 

GAC AAA TCC GGA CCA 
asp lys ser gly pro 



ACA CTT GAG TTC GGC 
thr leu glu phe gly 

GTT CGG TGA GGG TAC 
val arg OPA gly tyr 

CGG TTT CGA CGT TGG 
arg phe arg arg trp 

TGG ATG CCC AGG CAC 
trp met pro arg his 

GGA GGA ACA GTG GGC 
gly gly thr val gly 

GCA GCC GGA TGC ACC 
ala ala gly cys thr 

CTG CAT CAG GAT C 
leu his gin asp 



33/11 

TGC TAA CGA CGC CAG 
cys OCH arg arg gin 
93/31 

AGC GGA CCT TCG AGC 
ser gly pro ser ser 
153/51 

TGA CGA CCC CTC GTT 
OPA arg pro leu val 
213/71 

CGC CGG TAC TGC TGC 
arg arg tyr cys cys 
273/91 

AAG CAG CTA GCC GCG 
lys gin leu ala ala 
333/111 

AAC GTG GTC GAC GGG 
asn val val asp gly 



AGT CGC CCG CTT CCG 
ser arg pro leu pro 

ACG CAA TAT CGT GGG 
thr gin tyr arg gly 

CAT GAA TCG TTC TTG 
his glu ser phe leu 

GCT TAA GCT TGT CGC 
ala OCH ala cys arg 

CTC GCC GCG CTG GTC 
leu ala ala leu val 

ACC GCC GTG GCT GCC 
thr ala val ala ala 



SEQ ID NO.20C 
FIGURE 20C 
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part of 


the 


nucleotide sequence 


of seq20A 


















£_ 

1/1 


















31/11 


















TGT GGG 


ACT 


CAC 


GTT 


CGG 


TGA 


GGG 


TAC 


AGC 


GGA CCT 


TCG 


AGC 


ACG 


CAA 


TAT 


CGT 


GGG 


CCG 


cys gly 


thr 


his 


val 


arg 


OPA gly 


tyr 


ser 


gly pro 


ser 


ser 


thr 


gin 


tyr 


arg 


gly 


pro 


61/21 


















91/31 


















GCT GGC 


AAC 


CGT 


CGG 


TTT 


CGA 


CGT 


TGG 


TGA 


CGA CCC 


CTC 


GTT 


CAT 


GAA 


TCG 


TTC 


TTG 


AGC 


ala gly 


asn 


arg 


arg 


phe 


arg 


arg 


trp 


OPA 


arg pro 


leu 


val 


his 


glu 


ser 


phe 


leu 


ser 


121/41 


















151/51 


















TCC CCG 


TTT 


TGC 


TGG 


ATG 


CCC 


AGG 


CAC 


CGC 


CGG TAC 


TGC 


TGC 


GCT 


TAA 


GCT 


TGT 


CGC 


ACA 


ser pro 


phe 


cys 


trp 


met 


pro 


arg 


his 


arg 


arg tyr 


cys 


cys 


ala 


OCH 


ala 


cys 


arg 


thr 


181/61 
















211/71 


















TGG TGC 


CGG 


CAG 


GGA 


GGA 


ACA 


GTG 


GGC 


AAG 


CAG CTA 


GCC 


GCG 


CTC 


GCC 


GCG 


CTG 


GTC 


GGT 


trp cys 


arg 


gin 


gly 


gly 


thr 


val 


gly 


lys 


gin leu 


ala 


ala 


leu 


ala 


ala 


leu 


val 


gly 


241/81 


















271/91 


















GCG TGC 


ATG 


CTC 


GCA 


GCC 


GGA 


TGC 


ACC 


AAC 


GTG GTC 


GAC 


GGG 


ACC 


GCC 


GTG 


GCT 


GCC 


GAC 


ala cys 


met 


leu 


ala 


ala 


gly 


cys 


thr 


asn 


val val 


asp 


gly 


thr 


ala 


val 


ala 


ala 


asp 


301/101 




































AAA TCC 


GGA 


CCA 


CTG 


CAT 


CAG 


GAT 


C 






















lys ser 


gly 


pro 


leu 


his 


gin 


asp 

























SEQ ID No.20A f 
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1/1 

GTG GGA CTC ACG TTC GGT GAG 
val gly leu -thr phe gly glu 
61/21 

CTG GCA ACC GTC GGT TTC GAC 
leu ala thr val gly phe asp 
121/41 

CCC CGT TTT GCT GGA TGC CCA 
pro arg phe ala gly cys pro 
181/61 

GGT GCC GGC AGG GAG GAA CAG 
gly ala gly arg glu glu gin 
241/81 

CGT GCA TGC TCG CAG CCG GAT 
arg ala cys ser gin pro asp 
301/101 

AAT CCG GAC CAC TGC ATC AGG 
asn pro asp his cys ile arg 









31/11 




GGT 


ACA 


GCG 


GAC CTT 


CGA 


gly 


thr 


ala 


asp leu 


arg 








91/31 




GTT 


GGT 


GAC 


GAC CCC 


TCG 


val 


gly 


asp 


asp pro 


ser 








151/51 




GGC 


ACC 


GCC 


GGT ACT 


GCT 


gly 


thr 


ala 


gly thr 


ala 








211/71 




TGG 


GCA 


AGC 


AGC TAG 


CCG 


trp 


ala 


ser 


ser AMB 


pro 








271/91 




GCA 


CCA 


ACG 


TGG TCG 


ACG 


ala 


pro 


thr 


trp ser 


thr 


ATC 










ile 












SEQ 


ID 


NO.20B 


i 



FIGURE 20B* 



GCA CGC AAT ATC GTG GGC CGG 
ala arg asn ile val gly arg 

TTC ATG AAT CGT TCT TGA GCT 
phe met asn arg ser OPA ala 

GCG CTT AAG CTT GTC GCA CAT 
ala leu lys leu val ala his 

CGC TCG CCG CGC TGG TCG GTG 
arg ser pro arg trp ser val 

GGA CCG CCG TGG CTG CCG ACA 
gly pro pro trp leu pro thr 



REPLACEMENT SHEET (RULE 26) 



76/185 



1/1 








GTG TGG 


GAC 


TCA 


t^Lr 1 


val trp 


asp 


ser 


arg 


61/2 1 








GGC TGG 








gly trp 


gin 


pro 


ser 


i ?i/4l 

1.4. 1. f *1 J- 








CTC CCC 


GTT 


TTG 


CTG 


leu pro 


val 


leu 


leu 


181/61 








ATG GTG 


CCG 


GCA 


GGG 


met val 


pro 


ala 


gly 


241/81 








TGC GTG 


CAT 


GCT 


CGC 


cys val 


his 


ala 


arg 


301/101 








CAA ATC 


CGG 


ACC 


ACT 


gin ile 


arg 


thr 


thr 



TCG GTG AGG GTA CAG 
ser val arg val gin 

GTT TCG ACG TTG GTG 
val ser thr leu val 

GAT GCC CAG GCA CCG 
asp ala gin ala pro 

AGG AAC AGT GGG CAA 
arg asn ser gly gin 

AGC CGG ATG CAC CAA 
ser arg met his gin 

GCA TCA GGA TC 
ala ser gly 

SEQ 



31/11 








CGG ACC 


TTC 


GAG 


CAC 


arg thr 


phe 


glu 


his 


91/31 








ACG ACC 


CCT 


CGT 


TCA 


thr thr 


pro 


arg 


ser 


151/51 








CCG GTA 


CTG 


CTG 


CGC 


pro val 


leu 


leu 


arg 


211/71 








GCA GCT 


AGC 


CGC 


GCT 


ala ala 


ser 


arg 


ala 


271/91 








CGT GGT 


CGA 


CGG 


GAC 


arg gly 


arg 


arg 


asp 



ID No.20C 



GCA ATA TCG TGG GCC 
ala ile ser trp ala 

TGA ATC GTT CTT GAG 
OPA ile val leu glu 

TTA AGC TTG TCG CAC 
leu ser leu ser his 

CGC CGC GCT GGT CGG 
arg arg ala gly arg 

CGC CGT GGC TGC CGA 
arg arg gly cys arg 



FIGURE 20C 



sequence Rv3576 predicted by Cole et al. (Nature 393:537-544) and containing seq20A' 
1/1 31/11 

atg ggc aag cag eta gec gcg etc gec gcg ctg gtc ggt gcg tgc atg etc gca gec gga 
met gly lys gin leu ala ala leu ala ala leu val gly ala cys met leu ala ala gly 
61/21 91/31 

tgc acc aac gtg gtc gac ggg acc gec gtg get gec gac aaa tec gga cca ctg cat cag 
cys thr asn val val asp gly thr ala val ala ala asp lys ser gly pro leu his gin 

gat ccg ata ccg gtt tea gcg ctt gaa ggg ctg ctt etc gac ttg age cag ate aat gee 
asp pro ile pro val ser ala leu glu gly leu leu leu asp leu ser gin ile asn ala 
181/61 211/71 

gcg ctg ggt gcg aca teg atg aag gtg tgg ttc aac gee aag gca atg tgg gac tgg age 
ala leu gly ala thr ser met lys val trp phe asn ala lys ala met trp asp trp ser 
241/81 271/91 

aag age gtg gee gac aag aat tgc ctg get ate gac ggt cca gca cag gaa aag gtc tat 
lvs ser val ala asp lys asn cys leu ala ile asp gly pro ala gin glu lys val tyr 
301/101 331/111 

gee ggc acc ggg tgg acc get atg cgc ggc caa egg ctg gat gac age ate gat gac tec 
ala gly thr gly trp thr ala met arg gly gin arg leu asp asp ser lie asp asp ser 
361/121 391/131 

aag aaa cgc gac cac tac gec att caa gcg gtc gtc ggc ttc ccg acc gca cat gat gee 
lvs lys arg asp his tyr ala ile gin ala val val gly phe pro thr ala his asp ala 
421/141 451/151 

gag gag ttc tac age tec teg gtg caa age tgg age age tgc teg aac cgc egg ttt gtc 
qlu glu phe tyr ser ser ser val gin ser trp ser ser cys ser asn arg arg phe val 
481/161 511/171 

gaa gtc acc ccc gga cag gac gac gee gee tgg act gtg get gac gtt gtc aac gac aac 
qlu val thr pro gly gin asp asp ala ala trp thr val ala asp val val asn asp asn 
541/181 571/191 

ggc atg etc agt age teg cag gtt cag gaa ggc ggc gac gga tgg acc tgc cag cgt gec 
gly met leu ser ser ser gin val gin glu gly gly asp gly trp thr cys gin arg ala 
601/201 631/211 

eta act gcg cgc aac aac gtc act ate gac att gtc acg tgc gec tat age caa ccg gat 
leu thr ala arg asn asn val thr ile asp ile val thr cys ala tyr ser gin pro asp 
661/221 691/231 

ttg gtg gcg att ggc ate get aac caa ate gcg gec aag gtt get aag cag tag 
leu val ala ile gly ile ala asn gin ile ala ala lys val ala lys gin AMB 

SEQ ID No.20D 



FIGURE 20D 



REPLACEMENT SHEET (RULE 26) 



77/185 



ORF according to Cole et al . (Nature 393:537-544) and containing Rv3576 
1/1 31 ' 1:L 

taa get tgt cgc aca tgg tgc egg cag gga gga aca gtg ggc aag cag eta gec gcg etc 
OCH ala cys arg thr trp cys arg gin gly gly thr val gly lys gin leu ala ala leu 
61/21 91/31 

gec gcg ctg gtc ggt gcg tgc atg etc gca gec gga tgc ace aac gtg gtc gac ggg acc 
ala ala leu val gly ala cys met leu ala ala gly cys thr asn val val asp gly thr 
121/41 151/51 

gee gtg get gec gac aaa tec gga cca ctg cat cag gat ccg ata ccg gtt tea gcg ctt 
ala val ala ala asp lys ser gly pro leu his gin asp pro ile pro val ser ala leu 
181/61 211/71 

gaa ggg ctg ctt etc gac ttg age cag ate aat gec gcg ctg ggt gcg aca teg atg aag 
glu gly leu leu leu asp leu ser gin ile asn ala ala leu gly ala thr ser met lys 
241/81 271/91 

gtg tgg ttc aac gec aag gca atg tgg gac tgg age aag age gtg gee gac aag aat tgc 
val trp phe asn ala lys ala met trp asp trp ser lys ser val ala asp lys asn cys 
301/101 331/111 

ctg get ate gac ggt cca gca cag gaa aag gtc tat gee ggc acc ggg tgg acc get atg 
leu ala ile asp gly pro ala gin glu lys val tyr ala gly thr gly trp thr ala met 
361/121 391/131 

cgc ggc caa egg ctg gat gac age ate gat gac tec aag aaa cgc gac cac tac gec att 
arq qly gin arg leu asp asp ser ile asp asp ser lys lys arg asp his tyr ala lie 
421/141 451/151 

caa gcg gtc gtc ggc ttc ccg acc gca cat gat gee gag gag ttc tac age tec teg gtg 
gin ala val val gly phe pro thr ala his asp ala glu glu phe tyr ser ser ser val 
481/161 511/171 

caa age tgg age age tgc teg aac cgc egg ttt gtc gaa gtc acc ccc gga cag gac gac 
qln ser trp ser ser cys ser asn arg arg phe val glu val thr pro gly gin asp asp 
541/181 571/191 

gee gee tgg act gtg get gac gtt gtc aac gac aac ggc atg etc agt age teg cag gtt 
ala ala trp thr val ala asp val val asn asp asn gly met leu ser ser ser gin val 
601/201 631/211 

cag gaa ggc ggc gac gga tgg acc tgc cag cgt gec ctg act gcg cgc aac aac gtc act 
qln qlu gly gly asp gly trp thr cys gin arg ala leu thr ala arg asn asn val thr 
661/221 691/231 

ate gac att gtc acg tgc gee tat age caa ccg gat ttg gtg gcg att ggc ate get aac 
ile asp ile val thr cys ala tyr ser gin pro asp leu val ala ile gly ile ala asn 
721/241 

caa ate gcg gee aag gtt get aag cag tag 
gin ile ala ala lys val ala lys gin AMB 



SEQ ID NO.20F 
FIGURE 20F 



REPLACEMENT SHEET (RULE 26) 



# 



78/185 



1/1 

GTC CTG GTC GCC GCG 
val leu val ala ala 
61/21 

CGC TGG TAG TGG CGA 
arg trp AMB trp arg 
121/41 

GCG GCC TGC GGC TGG 
ala ala cys gly trp 
181/61 

CGG CGC TGG ACG TCG 
arg arg trp thr ser 
241/81 

AAA ACT TCA CCG CCC 
lys thr ser pro pro 
301/101 

CAG ACG TGC GGT CGG 
gin thr cys gly arg 
361/121 

TGC TGG CCG ACA GCA 
cys trp pro thr ala 



CAA 


CTG 


GCC 


GGT 


CCC 


gin 


leu 


ala 


gly 


pro 


TGG 


TGT 


TAG 


CAG 


GAT 


trp 


cys 


AMB 


gin 


asp 


TCG 


CCG 


CGC 


GCG 


CCG 


ser 


pro 


arg 


ala 


pro 


CCG 


TGC 


TGG 


CCA 


GCT 


pro 


cys 


trp 


pro 


ala 


GCA 


AGT 


ACG 


AGC 


TGC 


ala 


ser 


thr 


ser 


cys 


GAG 


TGA 


ACA 


CGC 


TGC 


glu 


OPA 


thr 


arg 


cys 


TCG 


GCT 


TGC 


GGG 


ATC 


ser 


ala 


cys 


gly 


ile 



31/11 

GAT GGA AAG TGT TCA 
asp gly lys cys ser 
91/31 

TGC GGG TCG AGG CTG 
cys gly ser arg leu 
151/51 

AAA TGA TAC CCG CGA 
lys OPA tyr pro arg 
211/71 

CGA CCG GAC ACG ATG 
arg pro asp thr met 
271/91 

AGA CGC GAC TGG CCG 
arg arg asp trp pro 
331/111 

TCA ACG GCG GTC AGG 
ser thr ala val arg 



CGA TCG CGC TTC TGC 
arg ser arg phe cys 

CGA TGG CCA GCA CCA 
arg trp pro ala pro 

TCA CGA AAT ACA TGT 
ser arg asn thr cys 

TGG AGG GGG CGC AGA 
trp arg gly arg arg 

ACA CCG ACG TCA TCG 
thr pro thr ser ser 

CGC TGC TGG ATA AGA 
arg cys trp ile arg 



TCC TGG TCG CCG CGC AAC TGG 
ser trp ser pro arg asn trp 
62/21 

GCT GGT AGT GGC GAT GGT GTT 
ala gly ser gly asp gly val 
~ 122/41 

CGG CCT GCG GCT GGT CGC CGC 
arg pro ala ala gly arg arg 
182/61 

GGC GCT GGA CGT CGC CGT GCT 
gly ala gly arg arg arg ala 
242/81 

AAA CTT CAC CGC CCG CAA GTA 
lys leu his arg pro gin val 
302/101 

AGA CGT GCG GTC GGG AGT GAA 
arg arg ala val gly ser glu 
362/121 

GCT GGC CGA CAG CAT CGG CTT 
ala gly arg gin his arg leu 



SEQ ID No.21A 
FIGURE 21A 

32/11 



CCG 


GTC 


CCG 


ATG GAA 


AGT 


pro 


val 


pro 


met glu 


ser 








92/31 




AGC 


AGG 


ATT 


GCG GGT 


CGA 


ser 


arg 


ile 


ala gly 


arg 








152/51 




GCG 


CGC 


CGA 


AAT GAT 


ACC 


ala 


arg 


arg 


asn asp 


thr 








212/71 




GGC 


CAG 


CTC 


GAC CGG 


ACA 


gly 


gin 


leu 


asp arg 


thr 








272/91 




CGA 


GCT 


GCA 


GAC GCG 


ACT 


arg 


ala 


ala 


asp ala 


thr 








332/111 




CAC 


GCT 


GCT 


CAA CGG 


CGG 


his 


ala 


ala 


gin arg 


arg 


GCG 


GGA 


TC 






ala 


gly 









SEQ ID No .2 IB 



FIGURE 21B 



GTT CAC GAT CGC GCT TCT GCC 
val his asp arg ala ser ala 

GGC TGC GAT GGC CAG CAC CAG 
gly cys asp gly gin his gin 

CGC GAT CAC GAA ATA CAT GTC 
arg asp his glu ile his val 

CGA TGT GGA GGG GGC GCA GAA 
arg cys gly gly gly ala glu 

GGC CGA CAC CGA CGT CAT CGC 
gly arg his arg arg his arg 

TCA GGC GCT GCT GGA TAA GAT 
ser gly ala ala gly OCH asp 



REPLACEMENT SHEET (RULE 26) 



* 

79/185 



33/11 



CCT GGT 


CGC 


CGC 


GCA 


ACT 


GGC 


CGG 


TCC 


CGA 


TGG AAA 


GTG 


TTC 


ACG 


ATC 


GCG 


CTT 


CTG 


CCG 


pro gly 


arg 


arg 


ala 


thr 


gly 


arg 


ser 


arg 


trp lys 


val 


phe 


thr 


ile 


ala 


leu 


leu 


pro 


63/21 


















93/31 


















CTG GTA 


GTG 


GCG 


ATG 


GTG 


TTA 


GCA 


GGA 


TTG 


CGG GTC 


GAG 


GCT 


GCG 


ATG 


GCC 


AGC 


ACC 


AGC 


leu val 


val 


ala 


met 


val 


leu 


ala 


gly 


leu 


arg val 


glu 


ala 


ala 


met 


ala 


ser 


thr 


ser 


123/41 


















153/51 


















GGC CTG 


CGG 


CTG 


GTC 


GCC 


GCG 


CGC 


GCC 


GAA 


ATG ATA 


CCC 


GCG 


ATC 


ACG 


AAA 


TAC 


ATG 


TCG 


gly leu 


arg 


leu 


val 


ala 


ala 


arg 


ala 


glu 


met ile 


pro 


ala 


ile 


thr 


lys 


tyr 


met 


ser 


183/61 


















213/71 


















GCG CTG 


GAC 


GTC 


GCC 


GTG 


CTG 


GCC 


AGC 


TCG 


ACC GGA 


CAC 


GAT 


GTG 


GAG 


GGG 


GCG 


CAG 


AAA 


ala leu 


asp 


val 


ala 


val 


leu 


ala 


ser 


ser 


thr gly 


his 


asp 


val 


glu 


gly 


ala 


gin 


lys 


243/81 


















273/91 


















AAC TTC 


ACC 


GCC 


CGC 


AAG 


TAC 


GAG 


CTG 


CAG 


ACG CGA 


CTG 


GCC 


GAC 


ACC 


GAC 


GTC 


ATC 


GCA 


asn phe 


thr 


ala 


arg 


lys 


tyr 


glu 


leu 


gin 


thr arg 


leu 


ala 


asp 


thr 


asp 


val 


ile 


ala 


303/101 


















333/111 


















GAC GTG 


CGG 


TCG 


GGA 


GTG 


AAC 


ACG 


CTG 


CTC 


AAC GGC 


GGT 


CAG 


GCG 


CTG 


CTG 


GAT 


AAG 


ATG 


asp val 


arg 


ser 


gly 


val 


asn 


thr 


leu 


leu 


asn gly 


gly 


gin 


ala 


leu 


leu 


asp 


lys 


met 


363/121 




































CTG GCC 


GAC 


AGC 


ATC 


GGC 


TTG 


CGG 


GAT 


C 




















leu ala 


asp 


ser 


ile 


gly 


leu 


arg 


asp 























SEQ ID No.21C 
FIGURE 21C 



part of the nucleotide sequence of seq21A 



1/1 31/11 



ACG ATC 


GCG 


CTT 


CTG 


CCG 


CTG 


GTA 


GTG 


GCG 


ATG GTG 


TTA 


GCA 


GGA 


TTG 


CGG 


GTC 


GAG 


GCT 


thr ile 


ala 


leu 


leu 


pro 


leu 


val 


val 


ala 


met val 


leu 


ala 


gly 


leu 


arg 


val 


glu 


ala 


61/21 


















91/31 


















GCG ATG 


GCC 


AGC 


ACC 


AGC 


GGC 


CTG 


CGG 


CTG 


GTC GCC 


GCG 


CGC 


GCC 


GAA 


ATG 


ATA 


CCC 


GCG 


ala met 


ala 


ser 


thr 


ser 


gly 


leu 


arg 


leu 


val ala 


ala 


arg 


ala 


glu 


met 


ile 


pro 


ala 


121/41 


















151/51 


















ATC ACG 


AAA 


TAC 


ATG 


TCG 


GCG 


CTG 


GAC 


GTC 


GCC GTG 


CTG 


GCC 


AGC 


TCG 


ACC 


GGA 


CAC 


GAT 


ile thr 


lys 


tyr 


met 


ser 


ala 


leu 


asp 


val 


ala val 


leu 


ala 


ser 


ser 


thr 


gly 


his 


asp 


181/61 


















211/71 


















GTG GAG 


GGG 


GCG 


CAG 


AAA 


AAC 


TTC 


ACC 


GCC 


CGC AAG 


TAC 


GAG 


CTG 


CAG 


ACG 


CGA 


CTG 


GCC 


val glu 


gly 


ala 


gin 


lys 


asn 


phe 


thr 


ala 


arg lys 


tyr 


glu 


leu 


gin 


thr 


arg 


leu 


ala 


241/81 


















271/91 


















GAC ACC 


GAC 


GTC 


ATC 


GCA 


GAC 


GTG 


CGG 


TCG 


GGA GTG 


AAC 


ACG 


CTG 


CTC 


AAC 


GGC 


GGT 


CAG 


asp thr 


asp 


val 


ile 


ala 


asp 


val 


arg 


ser 


gly val 


asn 


thr 


leu 


leu 


asn 


gly 


gly 


gin 


301/101 
















331/111 


















GCG CTG 


CTG 


GAT 


AAG 


ATG 


CTG 


GCC 


GAC 


AGC 


ATC GGC 


TTG 


CGG 


GAT 


C 










ala leu 


leu 


asp 


lys 


met 


leu 


ala 


asp 


ser 


ile gly 


leu 


arg 


asp 













SEQ ID No.21A f 
FIGURE 21A f 



REPLACEMENT SHEET (RULE 26) 



sequence Rv3365c predicted 
Seq21A' ' 

1/1 

gtg acc atg ttc gcc cgc ccg 
val thr met phe ala arg pro 
61/21 

ccg get caa ccg gcc cgc ggc 
pro ala gin pro ala arg gly 
121/41 

tgg ccg gtc cga tgg aaa gtg 
trp pro val arg trp lys val 
181/61 

tta gca gga ttg egg gtc gag 
leu ala gly leu arg val glu 
241/81 

gcg cgc gcc gaa atg ata ccc 
ala arg ala glu met ile pro 
301/101 

ctg gcc age teg acc gga cac 
leu ala ser ser thr gly his 
361/121 

tac gag ctg cag acg cga ctg 
tyr glu leu gin thr arg leu 
421/141 

aac acg ctg etc aac ggc ggt 
asn thr leu leu asn gly gly 
481/161 

ttg egg gat egg gtc acc gcc 
leu arg asp arg val thr ala 
541/181 

gcg teg gtg egg gtt gac age 
ala ser val arg val asp ser 
601/201 

gtt ggc gcc cgc ggg cag atg 
val gly ala arg gly gin met 
661/221 

gcc gag ccg caa ctg cgc age 
ala glu pro gin leu arg ser 
721/241 

ttc ggg atg age gcg gcg etc 
phe gly met ser ala ala leu 
781/261 

atg gtg acc agg atg gcg ate 
met val thr arg met ala ile 
841/281 

ctg ctg cac teg ata cag ate 
leu leu his ser ile gin ile 
901/301 

gag gcg gtg acg aag teg gtg 
glu ala val thr lys ser val 
961/321 

gac gcc gtg ctg gtg ttg gcc 
asp ala val leu val leu ala 



81/185 



by Cole et al . (Nature 



31/11 

acc ate ccg gtc gcg gcg 
thr ile pro val ala ala 
91/31 

aaa cct cag caa cgc ccg 
lys pro gin gin arg pro 
151/51 

ttc acg ate gcg ctt ctg 
phe thr ile ala leu leu 

211/71 
get gcg atg gcc age acc 
ala ala met ala ser thr 

271/91 
gcg ate acg aaa tac atg 
ala ile thr lys tyr met 

331/111 
gat gtg 'gag ggg gcg cag 
asp val glu gly ala gin 

391/131 
gcc gac acc gac gtc ate 
ala asp thr asp val ile 

451/151 
cag gcg ctg ctg gat aag 
gin ala leu leu asp lys 

511/171 
tac gcg ccg ctg ctg ttg 
tyr ala pro leu leu leu 

571/191 
gag caa ate cga acc cag 
glu gin ile arg thr gin 

631/211 
acg atg cag gag ate ctg 
thr met gin glu ile leu 

691/231 
gcg atg gtt acc ctg gcc 
ala met val thr leu ala 

751/251 
ggt gca ggc teg ccg gac 
gly ala gly ser pro asp 

811/271 
atg tec gat ccg gcc gtt 
met ser asp pro ala val 

871/291 
acc cgc gac att gcc gag 
thr arg asp ile ala glu 

931/311 
caa age cag gcc acc gac 
gin ser gin ala thr asp 

991/331 
gcc ate gcg acc gcg ate 
ala ile ala thr ala ile 



SEQ ID No.21F 
FIGURE 21D 



393:537-544) and containing 



gcc get tct gat att tec gcc 
ala ala ser asp ile ser ala 

ccg tec tgg teg ccg cgc aac 
pro ser trp ser pro arg asn 

ccg ctg gta gtg gcg atg gtg 
pro leu val val ala. met val 

age ggc ctg egg ctg gtc gcc 
ser gly leu arg leu val ala 

teg gcg ctg gac gtc gcc gtg 
ser ala leu asp val ala val 

aaa aac ttc acc gcc cgc aag 
lys asn phe thr ala arg lys 

gca gac gtg egg teg gga gtg 
ala asp val arg ser gly val 

gtg ctg gcc gac age ate ggc 
val leu ala asp ser ile gly 

acg gcc cag aac gtg att gac 
thr ala gin asn val ile asp 

gtg cag ggt ttg age cga gcc 
val gin gly leu ser arg ala 

gtg act cgc ggc gcc gac ctt 
val thr arg gly ala asp leu 

ggc acc gaa ccc teg acg ctg 
gly thr glu pro ser thr leu 

acc aag aac ctg cag cag caa 
thr lys asn leu gin gin gin 

gca ctg gtc aac aac cca gag 
ala leu val asn asn pro glu 

cag gtg ate acc gac acc acc 
gin val ile thr asp thr thr 

egg egg gat gcc gcg att cgc 
arg arg asp ala ala ile arg 

gtc gtc gtg ttg gtg gtg gcg 
val val val leu val val ala 



REPLACEMENT SHEET (RULE 26) 



0 

86/185 



1861/621 

gtc ggg ctg cgc ggt 
val gly leu arg gly 
1921/641 

ctg ccg eta gec gtg 
leu pro leu ala val 
1981/661 

gcg ate aag ccg ccg 
ala ile lys pro pro 
2041/681 

ate ggg ccg eta cca 
ile gly pro leu pro 
2101/701 

gee gac gtc ccg gee 
ala asp val pro ala 
2161/721 

gag gat agg ttt caa 
glu asp arg phe gin 
2221/741 

ccg ccc gec aaa ccc 
pro pro ala lys pro 
2281/761 

atg etc tec gag atg 
met leu ser glu met 
2341/781 

aag teg gtg tgg gac 
lys ser val trp asp 
2401/801 

tec cgc acg gac tac 
ser arg thr asp tyr 
2461/821 

gcg gtg cct gag gga 
ala val pro glu gly 
2521/841 

ctt cat ccc ggc cga 
leu his pro gly arg 
2581/861 

gee tec ate age age 
ala ser ile ser ser 
2641/881 

agt cag gga ccc aat 
ser gin gly pro asn 



ccg gtg ace ggt gaa 
pro val thr gly glu 

etc gag ggg acg gec 
leu glu gly thr ala 

tgt cct gaa ccc gcg 
cys pro glu pro ala 

ccg gtc acg ttg etc 
pro val thr leu leu 

cag ccg atg cag cag 
gin pro met gin gin 

cag gag ccc aaa caa 
gin glu pro lys gin 

gcg cca ccg gcg ggc 
ala pro pro ala gly 

gtg ggt gac ccg cac 
val gly asp pro his 

cac ggc tgg teg gcg 
his gly trp ser ala 

ggc ctg ccg gtg cgc 
gly leu pro val arg 

ccc gat egg gag cat 
pro asp arg glu his 

gcg ccg egg cac gcg 
ala pro arg his ala 

cat ttc ggc ggc gtg 
his phe gly gly val 

cag caa tga 
gin gin OPA 



1891/631 

cag ggc ace ggc ace 
gin gly thr gly thr 
1951/651 

cca gcg cag ccg cca 
pro ala gin pro pro 
2011/671 

gcg gee gat ccg acg 
ala ala asp pro thr 
2071/691 

ccg cgc cgt acc ccg 
pro arg arg thr pro 
2131/711 

egg egg cgc gag ctg 
arg arg arg glu leu 
2191/731 

ccg ccc gca . cca gaa 
pro pro ala pro glu 
2251/751 

ccg gtt gat gac gac 
pro val asp asp asp 
2311/771 

gag ctg gee cac age 
glu leu ala his ser 
2371/791 

gec gee gag gee gcg 
ala ala glu ala ala 
2431/811 

gaa ccc ggg gee egg 
glu pro gly ala arg 
2491/831 

ccg ggt gca gcg eta 
pro gly ala ala leu 
2551/851 

get gcg gta cgc gac 
ala ala val arg asp 
2611/871 

cgc acc ggg egg teg 
arg thr gly arg ser 



acc gee gag gtc tac 
thr ala glu val tyr 

aag ccg egg gta ttt 
lys pro arg val phe 

gac gtt ccc gee gee 
asp val pro ala ala 

ggg tec agt ggc ate 
gly ser ser gly ile 

aaa aca ccc tgg tgg 
lys thr pro trp trp 

ccg cga ccg gcg ccg 
pro arg pro ala pro 

gtc ate tac egg egg 
val ile tyr arg arg 

ccc gat ctg gac tgg 
pro asp leu asp trp 

gac aag ccc gtg cag 
asp lys pro val gin 

tta gtg ccg ggg gcg 
leu val pro gly ala 

gca tec aac ggc gga 
ala ser asn gly gly 

ccc gac gcg gtt cgt 
pro asp ala val arg 

cat gec cgc gag age 
his ala arg glu ser 



SEQ ID No.21F (continued 2) 
FIGURE 2 IF (continued 2) 



REPLACEMENT SHEET (RULE 26) 



87/185 



CTA CGA CAA GGC AAA GGA GCA CAG GGT 
leu arg gin gly lys gly ala gin gly 
61/21 

CGC CAT TCT GGT CGC AGG TCT TTC CGG 
arg his ser gly arg arg ser phe arg 
121/41 

TGA GAC CAC GAC CGC GGC AGG CAC GAC 
OPA asp his asp arg gly arg his asp 
181/61 

CGT CAT CGA CGG TAA GGA CCA GAA CGT 
arg his arg arg OCH gly pro glu arg 
241/81 

CAA TGT CAA CAT CGC GAT CGG CGG GGC 
gin cys gin his arg asp arg arg gly 
301/101 

CAA CCC TCC GGA GGT GAA GTC CGT TGG 
gin pro ser gly gly glu val arg trp 
361/121 

CAC GTC GGG CAC CGG ACA GGG TAA CGC 
his val gly his arg thr gly OCH arg 



31/11 

GAA GCG TGG ACT GAC GGT CGC GGT AGC CGG AGC 
glu ala trp thr asp gly arg gly ser arg ser 
91/31 

ATG TTC AAG CAA CAA GTC GAC TAC AGG AAG CGG 
met phe lys gin gin val asp tyr arg lys arg 
151/51 

GGC AAG CCC CGG CGC CGC ATC CGG GCC GAA GGT 
gly lys pro arg arg arg ile arg ala glu gly 
211/71 

CAC CGG GTC TGT GGT GTG CAC AAC CGC GGC CGG 
his arg val cys gly val his asn arg gly arg 
271/91 

GGC GAC CGG CAT TGC CGC CGT GCT CAC CGA CGG 
gly asp arg his cys arg arg ala his arg arg 
331/111 

GCT CGG TAA CGT CAA CGG CGT CAC GCT GGG ATA 
ala arg OCH arg gin arg arg his ala gly ile 
391/131 

TCG GCA ACC AAG GAC GGC AGC CAC TAC AAG ATC 
ser ala thr lys asp gly ser his tyr lys ile 



SEQ ID No.22A 



FIGURE 22A 



TAC GAC AAG GCA AAG GAG CAC AGG GTG AAG 
tyr asp lys ala lys glu his arg val lys 
62/21 

GCC ATT CTG GTC GCA GGT CTT TCC GGA TGT 
ala ile leu val ala gly leu ser gly cys 
122/41 

GAG ACC ACG ACC GCG GCA GGC ACG ACG GCA 
glu thr thr thr ala ala gly thr thr ala 
182/61 

GTC ATC GAC GGT AAG GAC CAG AAC GTC ACC 
val ile asp gly lys asp gin asn val thr 
242/81 

AAT GTC AAC ATC GCG ATC GGC GGG GCG GCG 
asn val asn ile ala ile gly gly ala ala 
302/101 

AAC CCT CCG GAG GTG AAG TCC GTT GGG CTC 
asn pro pro glu val lys ser val gly leu 
362/121 

ACG TCG GGC ACC GGA CAG GGT AAC GCT CGG 
thr ser gly thr gly gin gly asn ala arg 



32/11 

CGT GGA CTG ACG GTC GCG GTA GCC GGA GCC 
arg gly leu thr val ala val ala gly ala 
92/31 

TCA AGC AAC AAG TCG ACT ACA GGA AGC GGT 
ser ser asn lys ser thr thr gly ser gly 
152/51 

AGC CCC GGC GCC GCA TCC GGG CCG AAG GTC 
ser pro gly ala ala ser gly pro lys val 
212/71 

GGG TCT GTG GTG TGC ACA ACC GCG GCC GGC 
gly ser val val cys thr thr ala ala gly 
272/91 

ACC GGC ATT GCC GCC GTG CTC ACC GAC GGC 
thr gly ile ala ala val leu thr asp gly 
332/111 

GGT AAC GTC AAC GGC GTC ACG CTG GGA TAC 
gly asn val asn gly val thr leu gly tyr 
392/131 

CAA CCA AGG ACG GCA GCC ACT ACA AGA TC 
gin pro arg thr ala ala thr thr arg 



SEQ ID No.22B 
FIGURE 22B 



REPLACEMENT SHEET (RULE 26) 



ACG ACA AGG CAA AGG AGC ACA 
thr thr arg gin arg ser thr 
63/21 

CCA TTC TGG TCG CAG GTC TTT 
pro phe trp ser gin val phe 
123/41 

AGA CCA CGA CCG CGG CAG GCA 
arg pro arg pro arg gin ala 
183/61 

TCA TCG ACG GTA AGG ACC AGA 
ser ser thr val arg thr arg 
243/81 

ATG TCA ACA TCG CGA TCG GCG 
met ser thr ser arg ser ala 
303/101 

ACC CTC CGG AGG TGA AGT CCG 
thr leu arg arg OPA ser pro 
363/121 

CGT CGG GCA CCG GAC AGG GTA 
arg arg ala pro asp arg val 



88/185 

33/11 



GGG 


TGA 


AGC 


GTG GAC 


TGA 


CGG 


gly 


OPA 


ser 


val asp 


OPA 


arg 








93/31 






CCG 


GAT 


GTT 


CAA GCA 


ACA 


AGT 


pro 


asp 


val 


gin ala 


thr 


ser 






153/51 






CGA 


CGG 


CAA 


GCC CCG 


GCG 


CCG 


arg 


arg 


gin 


ala pro 


ala 


pro 








213/71 






ACG 


TCA 


CCG 


GGT CTG 


TGG 


TGT 


thr 


ser 


pro 


gly leu 


trp 


cys 








273/91 






GGG 


CGG 


CGA 


CCG GCA 


TTG 


CCG 


gly 


arg 


arg 


pro ala 


leu 


pro 








333/111 






TTG 


GGC 


TCG 


GTA ACG 


TCA 


ACG 


leu 


gly 


ser 


val thr 


ser 


thr 








393/131 






ACG 


CTC 


GGC 


AAC CAA 


GGA 


CGG 


thr 


leu 


gly 


asn gin 


gly 


arg 



SEQ ID No.22C 



FIGURE 22C 



TCG CGG TAG CCG GAG CCG 
ser arg AMB pro glu pro 

CGA CTA CAG GAA GCG GTG 
arg leu gin glu ala val 

CAT CCG GGC CGA AGG TCG 
his pro gly arg arg ser 

GCA. CAA CCG CGG CCG GCA 
ala gin pro arg pro ala 

CCG TGC TCA CCG ACG GCA 
pro cys ser pro thr ala 

GCG TCA CGC TGG GAT ACA 
ala ser arg trp asp thr 

CAG CCA CTA CAA GAT C 
gin pro leu gin asp 



GCA CAA CCG CGG CCG GCA ATG 
ala gin pro arg pro ala met 
61/21 

CCG TGC TCA CCG ACG GCA ACC 
pro cys ser pro thr ala thr 
121/41 

GCG TCA CGC TGG GAT ACA CGT 
ala ser arg trp asp thr arg 
181/61 

GCA GCC ACT ACA AGA TCA CAG 
ala ala thr thr arg ser gin 
241/81 

CAT TCT GGT CGC AGG TCT TTC 
his ser gly arg arg ser phe 
301/101 

GAC CAC GAC CGC GGC AGG CAC 
asp his asp arg gly arg his 
361/121 

ATC GAC GGT AAG GAC CAG AAC 
ile asp gly lys asp gin asn 
421/141 

GTC AAC ATC GCG ATC GGC GGG 
val asn ile ala ile gly gly 
481/161 

CCT CCG GAG GTG AAG TCC GTT 
pro pro glu val lys ser val 
541/181 

TCG GGC ACC GGA CAG GGT AAC 
ser gly thr gly gin gly asn 



31/11 

TCA ACA TCG CGA TCG GCG 
ser thr ser arg ser ala 
91/31 

CTC CGG AGG TGA AGT CCG 
leu arg arg OPA ser pro 
151/51 

CGG GCA CCG GAC AGG GTA 
arg ala pro asp arg val 

211/71 
GGT GAA GCG TGG ACT GAC 
gly glu ala trp thr asp 

271/91 

CGG ATG TTC AAG CAA CAA 
arg met phe lys gin gin 

331/111 
GAC GGC AAG CCC CGG CGC 
asp gly lys pro arg arg 

391/131 
GTC ACC GGC TCC GTG GTG 
val thr gly ser val val 

451/151 
GCG GCG ACC GGC ATT GCC 
ala ala thr gly ile ala 

511/171 
GGG CTC GGT AAC GTC AAC 
gly leu gly asn val asn 

571/191 
GCC TCG GCA ACC AAG GAC 
ala ser ala thr lys asp 

SEQ ID No.23A 



GGG CGG CGA CCG GCA TTG CCG 
gly arg arg pro ala leu pro 

TTG GGC TCG GTA ACG TCA ACG 
leu gly ser val thr ser thr 

ACG CCT CGG CAA CCA AGG ACG 
thr pro arg gin pro arg thr 

GGT CGC GGT AGC CGG AGC CGC 
gly arg gly ser arg ser arg 

GTC GAC TAC AGG AAG CGG TGA 
val asp tyr arg lys arg OPA 

CGC TCC GGG CCG AAG GTC GTC 
arg ser gly pro lys val val 

TGC ACA ACC GCG GCC GGC AAT 
cys thr thr ala ala gly asn 

GCC GTG CTC ACC GAC GGC AAC 
ala val leu thr asp gly asn 

GGC GTC ACG CTG GGA TAC ACG 
gly val thr leu gly tyr thr 

GGC AGC CAC TAC AAG ATC 
gly ser his tyr lys ile 



FIGURE 23A 
REPLACEMENT SHEET (RULE 26) 



♦ 

89/185 



32/11 



CAC AAC 


CGC 


GGC 


CGG 


CAA 


TGT 


CAA 


CAT 


CGC 


/"AT r r r 
bAl L-^Cj 


r r r 


r r r 

but 


bb^ 


fZAP 


rrr 


PAT 


TGC 


CGC 


his asn 


arg 


gly 


arg 


gin 


cys 


gin 


nis 


arg 


asp arg 


arg 


gly 


gly 


asp 


a rg 


his 


cys 


arg 


62/21 
































CGG 


CGT GCT 


CAC 


CGA 


CGG 


CAA 


r r r 

ecu 






VJ^J 1 




\^ \J X 


x vj\j 


GCT 


CGG 


TAA 


CGT 


CAA 


arg ala 


his 


arg 


arg 


gin 


pro 


ser 


gly 


gly 


glu val 


a r g 




ala 


arg 


OCH 


arg 


gin 


arg 


122/41 


















1 S? /Si 


















CGT CAC 


GCT 


GGG 


ATA 


CAC 


T 1 r 


c r* C 


par 


CC.CI 


apa p-pp* 


TAA 


CGC 


CTC 


GGC 


AAC 


CAA 


GGA 


CGG 


arg his 


ala 


gly 


ile 


nis 


vai 


gly 


hi s 


arg 


(-ill. y x 


OCH 


arg 


leu 


crl v 

y j_y 


asn 


gin 


alv 


ara 


182/61 
















212/71 


















CAG CCA 


CTA 


CAA 


GAT 


r 7\ r 


Abo 


Vj J. \3 


aap, 


V— O X 




ACG 


GTC 


GCG 


GTA 


GCC 


GGA 


GCC 


GCC 


gin pro 


leu 


gin 


asp 


nis 


a rg 


vax 


lys 


arg 


gly leu 


thr 


val 


ala 


val 


ala 


crl v 


ala 


ala 


242/81 
















070 /qi 

^ f *L / -7 X. 


















ATT CTG 


GTC 


GCA 


GGT 


CTT 


mr r 

TCC 




lbl 




AP.P AAP 


aap* 


TPP» 


ACT 


ACA 


GGA 


AGC 


GGT 


GAG 


ile leu 


val 


ala 


gly 


leu 


ser 


gly 


cys 


ear 

OCX, 


OCX. CI O 11 


1 V*5 


ser 


thr 


thr 


alv 


ser 


alv 


alu 

J- «-t. 


302/101 
















332/111 


















ACC ACG 


ACC 


GCG 


GCA 


GGC 


ACG 


A r r 




ACr 




err 


GCT 


CCG 


GGC 


CGA 


AGG 


TCG 


TCA 


un x. m x. 


thr 


ala 


ala 


alv 


thr 


thr 


ala 


ser 


pro gly 


ala 


ala 


pro 


gly 


arg 


arg 


ser 


ser 


362/121 
















392/131 


















TCG ACG 


GTA 


AGG 


ACC 


AGA 


ACG 


TCA 


CCG 


GCT 


CCG TGG 


TGT 


GCA 


CAA 


CCG 


CGG 


CCG 


GCA 


ATG 


ser thr 


val 


arg 


thr 


arg 


thr 


ser 


pro 


ala 


pro trp 


cys 


ala 


gin 


pro 


arg 


pro 


ala 


met 


422/141 
















452/151 


















TCA AC A 


TCG 


CGA 


TCG 


GCG 


GGG 


CGG 


CGA 


CCG 


GCA TTG 


CCG 


CCG 


TGC 


TCA 


CCG 


ACG 


GCA 


ACC 


ser thr 


ser 


arg 


ser 


ala 


gly 


arg 


arg 


pro 


ala leu 


pro 


pro 


cys 


ser 


pro 


thr 


ala 


thr 


482/161 
















512/171 


















CTC CGG 


AGG 


TGA 


AGT 


CCG 


TTG 


GGC 


TCG 


GTA 


ACG TCA 


ACG 


GCG 


TCA 


CGC 


TGG 


GAT 


ACA 


CGT 


leu arg 


arg 


OPA 


ser 


pro 


leu 


gly 


ser 


val 


thr ser 


thr 


ala 


ser 


arg 


trp 


asp 


thr 


arg 


542/181 
















572/191 


















CGG GCA 


CCG 


GAC 


AGG 


GTA 


ACG 


CCT 


CGG 


CAA 


CCA AGG 


ACG 


GCA 


GCC 


ACT 


ACA 


AGA 


TC 




arg ala 


pro 


asp 


arg 


val 


thr 


pro 


arg 


gin 


pro arg 


thr 


ala 


ala 


thr 


thr 


arg 







SEQ ID No.23B 
FIGURE 23B 



REPLACEMENT SHEET (RULE 26) 



♦ 

90/185 



33/11 



ACA ACC 


GCG 


GCC 


GGC 


AAT 


GTC 


AAC 


ATC 


GCG 


ATC GGC 


GGG 


GCG 


GCG 


ACC 


GGC 


ATT 


GCC 


GCC 


thr thr 


ala 


ala 


gly 


asn 


val 


asn 


ile 


ala 


ile gly 


gly 


ala 


ala 


thr 


gly 


ile 


ala 


ala 


63/21 


















93/31 


















GTG CTC 


ACC 


GAC 


GGC 


AAC 


CCT 


CCG 


GAG 


GTG 


AAG TCC 


GTT 


GGG 


CTC 


GGT 


AAC 


GTC 


AAC 


GGC 


val leu 


thr 


asp 


gly 


asn 


pro 


pro 


glu 


val 


lys ser 


val 


gly 


leu 


gly 


asn 


val 


asn 


gly 


123/41 


















153/51 


















GTC ACG 


CTG 


GGA 


TAC 


ACG 


TCG 


GGC 


ACC 


GGA 


CAG GGT 


AAC 


GCC 


TCG 


GCA 


ACC 


AAG 


GAC 


GGC 


val thr 


leu 


gly 


tyr 


thr 


ser 


gly 


thr 


gly 


gin gly 


asn 


ala 


ser 


ala 


thr 


lys 


asp 


gly 


183/61 
















213/71 


















AGC CAC 


TAC 


AAG 


ATC 


ACA 


GGG 


TGA 


AGC 


GTG 


GAC TGA 


CGG 


TCG 


CGG 


TAG 


CCG 


CAG 


CCG 


CCA 


ser his 


tyr 


lys 


ile 


thr 


gly 


OPA 


ser 


val 


asp OPA 


arg 


ser 


arg 


AMB 


pro 


glu 


pro 


pro 


243/81 


















273/91 


















TTC TGG 


TCG 


CAG 


GTC 


TTT 


CCG 


GAT 


GTT 


CAA 


GCA ACA 


AGT 


CGA 


CTA 


CAG 


GAA 


GCG 


GTG 


AGA 


phe trp 


ser 


gin 


val 


phe 


pro 


asp 


val 


gin 


ala thr 


ser 


arg 


leu 


gin 


glu 


ala 


val 


arg 


303/101 


















333/111 


















CCA CGA 


CCG 


CGG 


CAG 










fZPP 


cccz cccz 


p r r 


CTC 


CGG 


GCC 


GAA 


GGT 


CGT 


CAT 


pro arg 


pro 


arg 


gin 


ala 


arg 


arg 


gin 


ala 


pro ala 


pro 


leu 


arg 


ala 


glu 


gly 


arg 


his 


363/121 


















393/131 


















CGA CGG 


TAA 


GGA 


CCA 


GAA 


CGT 


CAC 


CGG 


CTC 


CGT GGT 


GTG 


CAC 


AAC 


CGC 


GGC 


CGG 


CAA 


TGT 


arg arg 


OCH 


gly 


pro 


glu 


arg 


his 


arg 


leu 


arg gly 


val 


his 


asn 


arg 


gly 


arg 


gin 


cys 


423/141 


















453/151 


















CAA CAT 


CGC 


GAT 


CGG 


CGG 


GGC 


GGC 


GAC 


CGG 


CAT TGC 


CGC 


CGT 


GCT 


CAC 


CGA 


CGG 


CAA 


CCC 


gin his 


arg 


asp 


arg 


arg 


gly 


gly 


asp 


arg 


his cys 


arg 


arg 


ala 


his 


arg 


arg 


gin 


pro 


483/161 


















513/171 


















TCC GGA 


GGT 


GAA 


GTC 


CGT 


TGG 


GCT 


CGG 


TAA 


CGT CAA 


CGG 


CGT 


CAC 


GCT 


GGG 


ATA 


CAC 


GTC 


ser gly 


gly 


glu 


val 


arg 


trp 


ala 


arg 


OCH 


arg gin 


arg 


arg 


his 


ala 


gly 


ile 


his 


val 


543/181 


















573/191 


















GGG CAC 


CGG 


ACA 


GGG 


TAA 


CGC 


CTC 


GGC 


AAC 


CAA GGA 


CGG 


CAG 


CCA 


CTA 


CAA 


GAT 


C 




gly his 


arg 


thr 


gly 


OCH 


arg 


leu 


gly 


asn 


gin gly 


arg 


gin 


pro 


leu 


gin 


asp 







SEQ ID No.23C 
FIGURE 23C 



REPLACEMENT SHEET (RULE 26) 



♦ 

91/185 



31/11 



CTA ACG 


ACA 


GGC 


AAA 


GGA 


GCA 


CAG 


GGT 


GAA 


GCG TGG 


ACT 


GAC 


GGT 


CGC 


GGT 


AGC 


CGG 


AGC 


leu thr 


thr 


gly 


lys 


gly 


ala 


gin 


gly 


glu 


ala trp 


thr 


asp 


gly 


arg 


gly 


ser 


arg 


ser 


61/21 
















91/31 


















CGC CAT 


TCT 


GGT 


CGC 


AGG 


TCT 


TTC 


CGG 


ATG 


TTC AAG 


CAA 


CAA 


GTC 


GAC 


TAC 


AGG 


AAG 


CGG 


arg his 


ser 


gly 


arg 


arg 


ser 


phe 


arg 


met 


phe lys 


gin 


gin 


val 


asp 


tyr 


arg 


lys 


arg 


121/41 


















151/51 


















TGA GAC 


CAC 


GAC 


CGC 


GGC 


AGG 


CAC 


GAC 


GGC 


AAG CCC 


CGG 


CGC 


CGC 


TCC 


GGG 


CCG 


AAG 


GTC 


OPA asp 


his 


asp 


arg 


gly 


arg 


his 


asp 


gly 


lys pro 


arg 


arg 


arg 


ser 


gly 


pro 


lys 


val 


181/61 


















211/71 


















GTC ATC 


GAC 


GGT 


AAG 


GAC 


CAG 


AAC 


GTC 


ACC 


GGC TCC 


GTG 


GTG 


TGC 


ACA 


ACC 


GCG 


GCC 


GGC 


val ile 


asp 


gly 


lys 


asp 


gin 


asn 


val 


thr 


gly ser 


val 


val 


cys 


thr 


thr 


ala 


ala 


gly 


241/81 














271/91 


















AAT GTC 


AAC 


ATC 


GCG 


ATC 


GGC 


GGG 


GCG 


GCG 


ACC GGC 


ATT 


GCC 


GCC 


GTG 


CTC 


ACC 


GAC 


GGC 


asn val 


asn 


ile 


ala 


ile 


gly 


gly 


ala 


ala 


thr gly 


ile 


ala 


ala 


val 


leu 


thr 


asp 


gly 


301/101 


















331/111 


















AAC CCT 


CCG 


GAG 


GTG 


AAG 


TCC 


GTT 


GGG 


CTC 


GGT AAC 


GTC 


AAC 


GGC 


GTC 


ACG 


CTG 


GGA 


TAC 


asn pro 


pro 


glu 


val 


lys 


ser 


val 


gly 


leu 


gly asn 


val 


asn 


gly 


val 


thr 


leu 


gly 


tyr 


361/121 














391/131 


















ACG TCG 


GGC 


ACC 


GGA 


CAG 


GGT 


AAC 


GCC 


TCG 


GCA ACC 


AAG 


GAC 


GGC 


AGC 


CAC 


TAC 


AAG 


ATC 


thr ser 


gly 


thr 


gly 


gin 


gly 


asn 


ala 


ser 


ala thr 


lys 


asp 


gly 


ser 


his 


tyr 


lys 


ile 



SEQ ID No.24A 
FIGURE 24A 





















32/11 


















TAA CGA 


CAG 


GCA 


AAG 


GAG 


CAC 


AGG 


GTG 


AAG 


CGT GGA 


CTG 


ACG 


GTC 


GCG 


GTA 


GCC 


GGA 


GCC 


OCH arg 


gin 


ala 


lys 


glu 


his 


arg 


val 


lys 


arg gly 


leu 


thr 


val 


ala 


val 


ala 


gly 


ala 


62/21 
















92/31 


















GCC. ATT 


CTG 


GTC 


GCA 


GGT 


CTT 


TCC 


GGA 


TGT 


TCA AGC 


AAC 


AAG 


TCG 


ACT 


ACA 


GGA 


AGC 


GGT 


ala ile 


leu 


val 


ala 


gly 


leu 


ser 


gly 


cys 


ser ser 


asn 


lys 


ser 


thr 


thr 


qly 


ser 


gly 


122/41 


















152/51 


















GAG ACC 


ACG 


ACC 


GCG 


GCA 


GGC 


ACG 


ACG 


GCA 


AGC CCC 


GGC 


GCC 


GCT 


CCG 


GGC 


CGA 


AGG 


TCG 


glu thr 


thr 


thr 


ala 


ala 


gly 


thr 


thr 


ala 


ser pro 


gly 


ala 


ala 


pro 


gly 


arg 


arg 


ser 


182/61 


















212/71 


















TCA TCG 


ACG 


GTA 


AGG 


ACC 


AGA 


ACG 


TCA 


CCG 


GCT CCG 


TGG 


TGT 


GCA 


CAA 


CCG 


CGG 


CCG 


GCA 


ser ser 


thr 


val 


arg 


thr 


arg 


thr 


ser 


pro 


ala pro 


trp 


cys 


ala 


gin 


pro 


arg 


pro 


ala 


242/81 


















272/91 


















ATG TCA 


ACA 


TCG 


CGA 


TCG 


GCG 


GGG 


CGG 


CGA 


CCG GCA 


TTG 


CCG 


CCG 


TGC 


TCA 


CCG 


ACG 


GCA 


met ser 


thr 


ser 


arg 


ser 


ala 


gly 


arg 


arg 


pro ala 


leu 


pro 


pro 


cys 


ser 


pro 


thr 


ala 


302/101 


















332/111 














GAT 


ACA 


ACC CTC 


CGG 


AGG 


TGA 


AGT 


CCG 


TTG 


GGC 


TCG 


GTA ACG 


TCA 


ACG 


GCG 


TCA 


CGC 


TGG 


thr leu 


arg 


arg 


OPA 


ser 


pro 


leu 


gly 


ser 


val thr 


ser 


thr 


ala 


ser 


arg 


trp 


asp 


thr 


362/121 














392/131 














AGA 


TC 


CGT CGG 


GCA 


CCG 


GAC 


AGG 


GTA 


ACG 


CCT 


CGG 


CAA CCA 


AGG 


ACG 


GCA 


GCC 


ACT 


ACA 


arg arg 


ala 


pro 


asp 


arg 


val 


thr 


pro 


arg 


gin pro 


arg 


thr 


ala 


ala 


thr 


thr 


arg 





SEQ ID No.24B 
FIGURE 24B 



REPLACEMENT SHEET (RULE 26) 



\ 

« 



92/185 



33/11 



AAC LjAC 


Abb 




7\ rzri 




ACA 


GGG 


TGA 


AGC GTG GAC 


TGA 


CGG 


TCG 


CGG 


TAG 


CCG 


GAG 


CCG 


asn asp 


arg 


gin 


arg 


ser 


thr 


alv 


OPA 


ser va 1 asp 


OPA 


arq 


ser 


arg 


AMB 


pro 


glu 


pro 












93/31 


















CCA lit 


1 UU 


1 


PAH 


GTC 


TTT 


CCG 


GAT 


GTT CAA GCA 


ACA 


AGT 


CGA 


CTA 


CAG 


GAA 


GCG 


GTG 


pro pne 


trp 




gin 


val 


phe 


p ro 


asD 


val gin ala 


thr 


ser 


arg 


leu 


gin 


glu 


ala 


val 
















153/51 


















AGA CCA 


tun 






PAG 


GCA 


CGA 


CGG 


CAA GCC CCG 


GCG 


CCG 


CTC 


CGG 


GCC 


GAA 


GGT 


CGT 


arq pro 


arg 


pro 


arg 


gin 


ala 


arg 


arg 


gin ala pro 


ala 


pro 


leu 


arg 


ala 


glu 


gly 


arg 


183/61 












213/71 


















CAT CGA 


CGG 


TAA 


GGA 


CCA 


GAA 


CGT 


CAC 


CGG CTC CGT 


GGT 


GTG 


CAC 


AAC 


CGC 


GGC 


CGG 


CAA 


his arg 


arg 


OCH 


gly 


pro 


glu 


arg 


his 


arg leu arg 


gly 


val 


his 


asn 


arg 


gly 


arg 


gin 


243/81 














273/91 


















TGT CAA 


CAT 


CGC 


GAT 


CGG 


CGG 


GGC 


GGC 


GAC CGG CAT 


TGC 


CGC 


CGT 


GCT 


CAC 


CGA 


CGG 


CAA 


cys gin 


his 


arg 


asp 


arg 


arg 


gly 


qly 


asp arg his 


cys 


arg 


arg 


ala 


his 


arg 


arg 


gin 


303/101 
















333/111 


















CCC TCC 


GGA 


GGT 


GAA 


GTC 


CGT 


TGG 


GCT 


CGG TAA CGT 


CAA 


CGG 


CGT 


CAC 


GCT 


GGG 


ATA 


CAC 


pro ser 


gly 


gly 


glu 


val 


arg 


trp 


ala 


arg OCH arg 


gin 


arg 


arg 


his 


ala 


gly 


ile 


his 


363/121 














393/131 


















GTC GGG 


CAC 


CGG 


ACA 


GGG 


TAA 


CGC 


CTC 


GGC AAC CAA 


GGA 


CGG 


CAG 


CCA 


CTA 


CAA 


GAT 


C 


val gly 


his 


arg 


thr 


gly 


OCH 


arg 


leu 


gly asn gin 


gly 


arg 


gin 


pro 


leu 


gin 


-asp 





SEQ ID No.24C 
FIGURE 24C 



Direct primer 
5 1 ACG CGG CGC AGC CTG TTG 3' 



SEQ ID No. 25 
FIGURE 25 



Reverse primer 
5 1 CGA CCT TGG GAT TCG CCT 3 ' 



SEQ ID No ,26 
FIGURE 26 



REPLACEMENT SHEET (RULE 26) 



93/185 



31/11 



CCT ACC 


AGC 


AAG 


AGC 


CCA 


GGG 


CTT 


CAC 


AGG 


ACC TAA 


AAG 


GAG 


TAG 


CGC 


CCA 


TGG 


GCT 


TGA 


pro thr 


ser 


lys 


ser 


pro 


gly 


leu 


his 


arg 


thr OCH 


lys 


glu 


AMB 


arg 


pro 


trp 


ala 


OPA 


61/21 


















91/31 


















TCC AAT 


TTT 


CCT 


TCC 


GCC 


CCG 


TGC 


AAT 


ACC 


ATC TGC 


AAG 


ACC 


AGC 


GAC 


GGC 


CCG 


TGG 


TTG 


ser asn 


phe 


pro 


ser 


ala 


pro 


cys 


asn 


thr 


ile cys 


lys 


thr 


ser 


asp 


gly 


pro 


trp 


leu 


121/41 
















151/51 


















CGG TCG 


CGC 


AGC 


TTG 


CGG 


AAA 


CGG 


GGT 


ATG 


GAC CCT 


GCC 


GTA 


CCG 


TTG 


TTG 


CCA 


CTT 


GAT 


arg ser 


arg 


ser 


leu 


arg 


lys 


arg 


gly 


met 


asp pro 


ala 


val 


pro 


leu 


leu 


pro 


leu 


asp 


181/61 
















211/71 


















GTC GTC 


GCT 


CTC 


CAC 


CCG 


TCG 


GGG 


GGC 


GAA 


AGC CAT 


TCC 


GAC 


ACT 


GGG 


ATC 


CTC 


AAA 


ACG 


val val 


ala 


leu 


his 


pro 


ser 


gly 


gly 


glu 


ser his 


ser 


asp 


thr 


gly 


ile 


leu 


lys 


thr 


241/81 


















271/91 


















TCG GCT 


GAG 


TGT 


CTG 


CAG 


GGC 


TCC 


GGG 


GAG 


CAG CCG 


ATC 


ATC 


ACC 


ATG 


TAC 


GAA 


CTG 


AAT 


ser ala 


glu 


cys 


leu 


gin 


gly 


ser 


gly 


glu 


gin pro 


ile 


ile 


thr 


met 


tyr 


glu 


leu 


asn 


301/101 














331/111 


















AAG TCC 


CCC 


CCG 


CGC 


GAC 


TTC 


CAG 


ACA 


TTT 


GTT GTG 


GTT 


TCG 


GTT 


GAG 


GCC 


GAG 


GCG 


AGG 


lys ser 


pro 


pro 


arg 


asp 


phe 


gin 


thr 


phe 


val val 


val 


ser 


val 


glu 


ala 


glu 


ala 


arg 


361/121 
















391/131 


















CTC ATT 


TCG 


CAG 


CAA 


GCG 


GTC 


TCC 


GGG 


TCG 


CAG CAT 


CGT 


TGC 


GGC 


GAT 


CGC 


GGC 


GCA 


GTC 


leu ile 


ser 


gin 


gin 


ala 


val 


ser 


gly 


ser 


gin his 


arg 


cys 


gly 


asp 


arg 


gly 


ala 


val 


421/141 




































GTC GGA 


CGA 


GTC 


GTC 


GTC 


AAC 


GAC 


CAC 


GAT 


C 


















val gly 


arg 


val 


val 


val 


asn 


asp 


his 


asp 





















SEQ ID No.27A 
FIGURE 27A 



CTA CCA GCA AGA GCC CAG GGC TTC ACA GGA 
leu pro ala arg ala gin gly phe thr gly 
61/21 

CCA ATT TTC CTT CCG CCC CGT GCA ATA CCA 
pro ile phe leu pro pro arg ala ile pro 
121/41 

GGT CGC GCA GCT TGC GGA AAC GGG GTA TGG 
gly arg ala ala cys gly asn gly val trp 
181/61 

TCG TCG CTC TCC ACC CGT CGG GGG GCG AAA 
ser ser leu ser thr arg arg gly ala lys 
241/81 

CGG CTG AGT GTC TGC AGG GCT CCG GGG AGC 
arg leu ser val cys arg ala pro gly ser 
301/101 

AGT CCC CCC CGC GCG ACT TCC AGA CAT TTG 
ser pro pro arg ala thr ser arg his leu 
361/121 

TCA TTT CGC AGC AAG CGG TCT CCG GGT CGC 
ser phe arg ser lys arg ser pro gly arg 
421/141 

TCG GAC GAG TCG TCG TCA ACG ACC ACG ATC 
ser asp glu ser ser ser thr thr thr ile 



31/11 

CCT AAA AGG AGT AGC GCC CAT GGG CTT GAT 
pro lys arg ser ser ala his gly leu asp 
91/31 

TCT GCA AGA CCA GCG ACG GCC CGT GGT TGC 
ser ala arg pro ala thr ala arg gly cys 
151/51 

ACC CTG CCG TAC CGT TGT TGC CAC TTG ATG 
thr leu pro tyr arg cys cys his leu met 
211/71 

GCC ATT CCG ACA CTG GGA TCC TCA AAA CGT 
ala ile pro thr leu gly ser ser lys arg 
271/91 

AGC CGA TCA TCA CCA TGT ACG AAC TGA ATA 
ser arg ser ser pro cys thr asn OPA ile 
331/111 

TTG TGG TTT CGG TTG AGG CCG AGG CGA GGC 
leu trp phe arg leu arg pro arg arg gly 
391/131 

AGC ATC GTT GCG GCG ATC GCG GCG CAG TCG 
ser ile val ala ala ile ala ala gin ser 



SEQ ID No.27B 



FIGURE 27B 
REPLACEMENT SHEET (RULE 26) 



94/185 

33/11 

TAC CAG CAA GAG. CCC AGG GCT TCA CAG GAC CTA AAA GGA GTA GCG CCC ATG GGC TTG ATC 
tyr gin gin glu pro arg ala ser gin asp leu lys gly val ala pro met gly leu ile 
63/21 93/31 

CAA TTT TCC TTC CGC CCC GTG CAA TAC CAT CTG CAA GAC CAG CGA CGG CCC GTG GTT GCG 
gin phe ser phe arg pro val gin tyr his leu gin asp gin arg arg pro val val ala 
123/41 153/51 . 

GTC GCG CAG CTT GCG GAA ACG GGG TAT GGA CCC TGC CGT ACC GTT GTT GCC ACT TGA TGT 
val ala gin leu ala glu thr gly tyr gly pro cys arg thr val val ala thr OP A cys 
183/61 213/71 

CGT CGC TCT CCA CCC GTC GGG GGG CGA AAG CCA TTC CGA CAC TGG GAT CCT CAA AAC GTC 
arg arg ser pro pro val gly gly arg lys pro phe arg his trp asp pro gin asn val 
243/81 273/91 

GGC TGA GTG TCT GCA GGG CTC CGG GGA GCA GCC GAT CAT CAC CAT GTA CGA ACT GAA TAA 
gly OPA val ser ala gly leu arg gly ala ala asp his his his val arg thr glu OCH 
303/101 333/111 

GTC CCC CCC GCG CGA CTT CCA GAC ATT TGT TGT GGT TTC GGT TGA GGC CGA GGC GAG GCT 
val pro pro ala arg leu pro asp ile cys cys gly phe gly OPA gly arg gly glu ala 
363/121 393/131 

CAT TTC GCA GCA AGC GGT CTC CGG GTC GCA GCA TCG TTG CGG CGA TCG CGG CGC AGT CGT 
his phe ala ala ser gly leu arg val ala ala ser leu arg arg ser arg arg ser arg 
423/141 

CGG ACG AGT CGT CGT CAA CGA CCA CGA TC 
arg thr ser arg arg gin arg pro arg 

SEQ ID No.27C 
FIGURE 27C 



MKT GTATT RRRL LAVL I AJLAL P GAAVAL LAE P SAT GAS DPCAAS EVAR 
TVGSVAKSMGDYLDSHPETNQVTOTAVLQQQVGPGSVASLKAHFEANPK 
VAS D LHAL S Q P LT D L S T RC S L P I S G LQAI GLMQAVQGARR 



SEQ ID No. 28 
FIGURE 28 



GTGGGCAAGC AGCTAGCCGC GCTCGCCGCG CTGGTCGGTG CGTGCATGCT CGCAGCCGGA 60 

TGCACCAACG TGGTCGACGG GACCGCCGTG GCTGCCGACA AATCCGGACC ACT G CAT CAG 12 0 

GAT CCGAT AC CGGTTTCAGC GCTTGAAGGG CTGCTTCTCG ACTTGAGCCA GATCAATGCC 18 0 

GCGCTGGGTG C GAC AT C GAT GAAGGTGTGG TTCAACGCCA AGGCAATGTG GGACTGGAGC 24 0 

AAG AG C GT G G CCGACAAGAA TTGCCTGGCT ATC GAC GGT C CAGCACAGGA AAAG GTC TAT 300 

GCCGGCACCG GGTGGACCGC TATGCGCGGC CAACGGCTGG AT GACAGCAT C GAT G ACT C C 360 

AAGAAACGCG ACCACTACGC CAT T CAAGCG GTCGTCGGCT TCCCGACCGC AC AT GAT GCC 420 

GAGGAGTTCT ACAGCTCCTC GGTGCAAAGC TGGAGCAGCT GCTCGAACCG CCGGTTTGTC 48 0 

GAAGT CAC C C CCGGACAGGA CGACGCCGCC TGGACTGTGG CTGACGTTGT CAACGACAAC 54 0 

GGCATGCTCA GTAGCTCGCA GGTTCAGGAA GGCGGCGACG GATGGACCTG CCAGCGTGCC 600 

CTGACTGCGC GCAACAACGT C ACT AT C GAC ATTGTCACGT GCGCCTATAG CCAACCGGAT 660 

TTGGTGGCGA TTGGCATCGC TAACCAAATC GCGGCCAAGG TTGCTAAGCA GTAG 714 



SEQ ID No. 29 

FIGURE 29 
REPLACEMENT SHEET (RULE 26) 



95/185 



MGKQLAALAAXVGACMIJXAGCTNVV^ 

MKVWFNAKAMWDWSKSVADKNCI^IDGPAQEKVYAGTGFTWTAMRGQRLDDSIDDSKKRDHYAI QAW 
GFPTAHDAEEFYSSSVQSWSSCSNRREVEVTFTPGQDDAAWTVADVVNDNGMLSSSQVQEGGDGWTCQ 
RALTARNNVT I D I VTCAYSQ PDLVFTAI GIANQI AAKVAKQ 



SEQ ID No ,30 
FIGURE 30 



1/1 31/11 



AGG CGA 


ATA 


CCC 


GCG 


AGG 


GCA 


r-> 


CGA 


CGG 


CGG CCC 


TGC 


C(Z(Z 
Lob 






ClCrC 


A. VjV_* 






arg arg 


ile 


pro 


ala 


arg 


ala 


ala 


arg 


arg 


arg pro 


cys 


arg 


arg 


a r g 


gly 


cys 


OPA 


1-h r 


61/21 


















91/31 


















ACA CAT 


CCC 


AGC 


CGC 


bLA 


r~* r— r~< 


1 1 L- 


CGG 


TAT 


GCG GCA 


GGA 


TAA 




ACT 


CCA 


ACA 


GCA 


CGA 


thr his 


pro 


ser 


arg 


a J. a 


arg 


phe 


arg 


tyr 


ala ala 


gly 




th r 


thr 


pro 


thr 


ala 


arg 


121/41 
















151/51 


















ar*zi cent. 




1 1*J 


CGA 


CAA 


CCA 


AAG 


CCC 


TCG 


CGC CTG 


GCT 


CGA 


TTT 


CGC 


GCG 


CAA 


CGC 


GGC 


thr pro 


gly 


leu 


arg 


gin 


pro 


lys 


pro 


ser 


arg leu 


ala 


arg 


phe 


arg 


ala 


gin 


arg 


gly 


181/61 


















211/71 


















GTT CTG 


CCG 


CCT 


CGA 


TCT 


CAG 


CGC 


GGA 


GGG 


CGT CGA 


GAT 


C 














val leu 


pro 


pro 


arg 


ser 


gin 


arg 


gly 


gly 


arg arg 


asp 






























SEQ ID 


No.31A 






























FIGURE 31A 


















1/1 


















31/11 


















GGC GAA 


TAG 


CCG 


CGA 


GGG 


CAG 


CGC 


GAC 


GGC 


GGC CCT 


GCC 


GGC 


GCC 


GTG 


GCT 


GCT 


GAA 


CAA 


gly glu 


tyr 


pro 


arg 


gly 


gin 


arg 


asp 


gly 


gly pro 


ala 


gly 


ala 


val 


ala 


ala 


glu 


gin 


61/21 


















91/31 


















CAC ATC 


CCA 


GCC 


GCG 


CAC 


GCT 


TCC 


GGT 


ATG 


CGG CAG 


GAT 


AAA 


CGA 


CCC 


CAA 


CAG 


CAC 


GAA 


his ile 


pro 


ala 


ala 


his 


ala 


ser 


gly 


met 


arg gin 


asp 


lys 


arg 


pro 


gin 


gin 


his 


glu 


121/41 
















151/51 


















CAC CAG 


GAT 


TGC 


GAC 


AAC 


CAA 


AGC 


CCT 


CGC 


GCC TGG 


CTC 


GAT 


TTC 


GCG 


CGC 


AAC 


GCG 


GCG 


his gin 


asp 


cys 


asp 


asn 


gin 


ser 


pro 


arg 


ala trp 


leu 


asp 


phe 


ala 


arg 


asn 


ala 


ala 


181/61 


















211/71 


















TTC TGC 


CGC 


CTC 


GAT 


CTC 


AGC 


GCG 


GAG 


GGC 


GTC GAG 


ATC 
















phe cys 


arg 


leu 


asp 


leu 


ser 


ala 


glu 


gly 


val glu 


ile 

















SEQ ID No.31B 
FIGURE 31B 



REPLACEMENT SHEET (RULE 26) 



96/185 



l/i 

GCG AAT ACC CGC GAG 
ala asn thr arg glu 
61/21 

ACA TCC CAG CCG CGC 
thr ser gin pro arg 
121/41 

ACC AGG ATT GCG ACA 
thr arg ile ala thr 
181/61 

TCT GCC GCC TCG ATC 
ser ala ala ser ile 



GGC AGC GCG ACG GCG 
gly ser ala thr ala 

ACG CTT CCG GTA TGC 
thr leu pro val cys 

ACC AAA GCC CTC GCG 
thr lys ala leu ala 

TCA GCG CGG AGG GCG 
ser ala arg arg ala 



31/11 

GCC CTG CCG GCG CCG 
ala leu pro ala pro 
91/31 

GGC AGG ATA AAC GAC 
gly arg ile asn asp 
151/51 

CCT GGC TCG ATT TCG 
pro gly ser ile ser 
211/71 
TCG AGA TC 
ser arg 



TGG CTG CTG AAC AAC 
trp leu leu asn asn 

CCC AAC AGC ACG AAC 
pro asn ser thr asn 

CGC GCA ACG CGG CGT 
arg ala thr arg arg 



SEQ ID No.31C 



FIGURE 31C 



ORF according to Cole et al . (Nature 393:537-544) and containing seq31A 
1/1 31/11 

taa acg acc cca aca gca cga aca cca gga ttg cga caa cca aag ccc teg cgc ctg get 
OCH thr thr pro thr ala arg thr pro gly leu arg gin pro lys pro ser arg leu ala 
61/21 91/31 

cga ttt cgc gcg caa cgc ggc gtt ctg ccg cct cga tct cag cgc gga ggg cgt cga gat - 
arg phe arg ala gin arg gly val leu pro pro arg ser gin arg gly gly arg arg asp 
121/41 151/51 

ccc egg cgt cgt gtt cgt ggc tea tea tct gca tec tec ggg ctt ggc cgc get gac egg 
pro arg arg arg val arg gly ser ser ser ala ser ser gly leu gly arg ala asp arg 
181/61 211/71 

cag ccc gac ccc agg cat gee cag gee gac ggc gcg ccc egg ctg ccc ggc ggt gtg cgc 
gin pro asp pro arg his ala gin ala asp gly ala pro arg leu pro gly gly val arg 
241/81 271/91 

gtc gec ggc gcg ggt gcg gcg gtg ggt cag gac gee ggc gtc ggc gat gag gtg gtg egg 
val ala gly ala gly ala ala val gly gin asp ala gly val gly asp glu val val arg 
301/101 331/111 

cgc cgc ttc ggt gac ctt cgt ggt gat gac gtc gec ggg acg cac gcg egg ctg gee ggc 
arg arg phe gly asp leu arg gly asp asp val ala gly thr his ala arg leu ala gly 
361/121 391/131 

ggt gaa gtg cac cag gcg ccc gtc gcg cgc ccg ccc get cat gcg cgc cgt gac ggt gtc 
gly glu val his gin ala pro val ala arg pro pro ala his ala arg arg asp gly val 
421/141 451/151 

ctt gcg ccc ttc ccc ggt ggc cac cag cac etc gac ggc ctg ccc gac cag ggc gcg gtt 
leu ala pro phe pro gly gly his gin his leu asp gly leu pro asp gin gly ala val 
481/161 511/171 

ggc ttc cag cga gat ttg etc ctg cag cgc gat cag gcg ttc ata gcg ttc ctg cac aac 
gly phe gin arg asp leu leu leu gin arg asp gin ala phe ile ala phe leu his asn 
541/181 571/191 

ggc ttt egg cag ctg tec gtc gag ttg cgc ggc egg tgt ccc ggg ccg ctt gga gta ttg 
gly phe arg gin leu ser val glu leu arg gly arg cys pro gly pro leu gly val leu 
601/201 631/211 

gaa ggt aaa tgc ggc cgc gaa gcg ggc ccg gcg cac cac gtc gag cgt ggc cgc gaa gtc 
glu gly lys cys gly arg glu ala gly pro ala his his val glu arg gly arg glu val 
661/221 691/231 

etc ttc ggt etc ccc ggg gaa acc gac gat cag ate ggt ggt aat cgc ggc atg egg gat 
leu phe gly leu pro gly glu thr asp asp gin ile gly gly asn arg gly met arg asp 
721/241 751/251 

ggc cgc ccg cac gcg etc gat gat gee gag gta gcg etc ggc acg ata gga ccg ccg cat 
gly arg pro his ala leu asp asp ala glu val ala leu gly thr ile gly pro pro his 
781/261 811/271 
cgc gcg cag gat ccg gtc gga tec gga ctg tag 
arg ala gin asp pro val gly ser gly leu AMB 



SEQ ID No.31F 



FIGURE 31F 



REPLACEMENT SHEET (RULE 26) 



# 



97/185 



1/1 

aga ctg gtg tac acg gag acc 
arg leu val tyr thr glu thr 
61/21 

eta gtg aag gtc att cag aaa 
leu val lys val ile gin lys 
121/41 

ttc gtc ggt ttc gcg egg atg 
phe val gly phe ala arg met 
181/61 

ccg ttg egg gac tac gaa ctg 
pro leu arg asp tyr glu leu 
241/81 

ggg ccg acc gcg ctg aac tat 
gly pro thr ala leu asn tyr 
301/101 

tac ggg cgc ate aaa cgc cag 
tyr gly arg ile lys arg gin 









31/11 




aag 


ctg 


aac 


teg gca 


ttc 


lys 


leu 


asn 


ser ala 


phe 








91/31 




ctg 


teg 


ggc 


ttg age 


ate 


leu 


ser 


gly 


leu ser 


ile 








151/51 




gtc 


gag 


gec 


etc ggc 


ggc 


val 


glu 


ala 


leu gly 


gly 








211/71 




ggc 


acg 


gtg 


ctg gag 


cac 


gly 


thr 


val 


leu glu 


his 








271/91 




gtg 


cgc 


get 


cgc cag 


gtc 


val 


arg 


ala 


arg gin 


val 








331/111 




cag 


ttg 


ttt 


ttg teg 


teg 


gin 


leu 


phe 


leu ser 


ser 



SEQ ID No.32A 
FIGURE 32A 



tec ttc ggc ggg cct aag tgt 
ser phe gly gly pro lys cys 

aac egg ttc ate gcg att gac 
asn arg phe ile ala ile asp 

gtc gag gta tgc age acc acc 
val glu val cys ser thr thr 

gee gga cgc cag gtc att gac 
ala gly arg gin val ile asp 

acc acc gag age aat ggc gac 
thr thr glu ser asn gly asp 

ctg ctg cgt teg atg ate 
leu leu arg ser met ile 



1/1 

gac tgg tgt aca egg aga cca age tga act 
asp trp cys thr arg arg pro ser OPA thr 
61/21 

tag tga agg tea ttc aga aac tgt egg get 
AMB OPA arg ser phe arg asn cys arg ala 
121/41 

teg teg gtt teg cgc gga tgg teg agg ccc 
ser ser val ser arg gly trp ser arg pro 
181/61 

cgt tgc ggg act acg aac tgg gca egg tgc 
arg cys gly thr thr asn trp ala arg cys 
241/81 

ggc cga ccg cgc tga act atg tgc gcg etc 
gly arg pro arg OPA thr met cys ala leu 
301/101 

acg ggc gca tea aac gee age agt tgt ttt 
thr gly ala ser asn ala ser ser cys phe 



31/11 


















egg cat 


tct 


cct 


teg 


gcg 


ggc 


eta 


agt 


gtc 


arg his 


ser 


pro 


ser 


ala 


gly 


leu 


ser 


val 


91/31 


















tga gca 


tea 


acc 


ggt 


tea 


teg 


cga 


ttg 


act 


OPA ala 


ser 


thr 


gly 


ser 


ser 


arg 


leu 


thr 


151/51 


















teg gcg 


gcg 


teg 


agg 


tat 


gca 


gca 


cca 


ccc 


ser ala 


ala 


ser 


arg 


tyr 


ala 


ala 


pro 


pro 


211/71 


















tgg age 


acg 


ccg 


gac 


gee 


agg 


tea 


ttg 


acg 


trp ser 


thr 


pro 


asp 


ala 


arg 


ser 


leu 


thr 


271/91 


















gee agg 


tea 


cca 


ccg 


aga 


gca 


atg 


gcg 


act 


ala arg 


ser 


pro 


pro 


arg 


ala 


met 


ala 


thr 


331/111 


















tgt cgt 


cgc 


tgc 


tgc 


gtt 


cga 


tga 


tc 




cys arg 


arg 


cys 


cys 


val 


arg 


OPA 







SEQ ID No.32B 
FIGURE 32B 



REPLACEMENT SHEET (RULE 26) 




98/185 



1/1 


















31/11 


















act ggt 


gta 


cac 


gga 


gac 


caa 


get 


gaa 


etc 


ggc att 


etc 


ctt 


egg 


egg 


gee 


taa 


gtg 


tct 


thr gly 


val 


his 


gly 


asp 


gin 


ala 


glu 


leu 


gly ile 


leu 


leu 


arg 


arg 


ala 


OCH 


val 


ser 


61/21 


















91/31 


















agt gaa 


ggt 


cat 


tea 


gaa 


act 


gtc 


ggg 


ctt 


gag cat 


caa 


ccg 


gtt 


cat 


cgc 


gat 


tga 


ctt 


ser glu 


gly 


his 


ser 


glu 


thr 


val 


gly 


leu 


glu his 


gin 


pro 


val 


his 


arg 


asp 


OPA 


leu 


121/41 
















151/51 


















cgt egg 


ttt 


cgc 


gcg 


gat 


ggt 


cga 


ggc 


cct 


egg egg 


cgt 


cga 


ggt 


atg 


cag 


cac 


cac 


cee 


arg arg 


phe 


arg 


ala 


asp 


gly 


arg 


gly 


pro 


arg arg 


arg 


arg 


gly 


met 


gin 


his 


his 


pro 


181/61 
















211/71 


















gtt gcg 


gga 


eta 


cga 


act 


ggg 


cac 


ggt 


get 


gga gca 


cgc 


egg 


acg 


cca 


ggt 


cat 


tga 


egg 


val ala 


gly 


leu 


arg 


thr 


gly 


his 


gly 


ala 


gly ala 


arg 


arg 


thr 


pro 


gly 


his 


OPA 


arg 


241/81 
















271/91 
















eta 


gec gac 


cgc 


get 


gaa 


eta 


tgt 


gcg 


cgc 


teg 


cca ggt 


cac 


cac 


cga 


gag 


caa 


tgg 


cga 


ala asp 


arg 


ala 


glu 


leu 


cys 


ala 


arg 


ser 


pro gly 


his 


his 


arg 


glu 


gin 


trp 


arg 


leu 


301/101 














331/111 


















egg gcg 


cat 


caa 


acg 


cca 


gca 


gtt 


gtt 


ttt 


gtc gtc 


get 


get 


gcg 


ttc 


gat 


gat 


c 




arg ala 


his 


gin 


thr 


pro 


ala 


val 


val 


phe 


val val 


ala 


ala 


ala 


phe 


asp 


asp 







SEQ ID No.32C 
FIGURE 32C 



sequence Rv0822c predicted by Cole et al . (Nature 393:537-544) and containing seq 32A 
1/1 31/11 

atg agt gac ggc gag age gee gcg ccg tgg gca egg etc tec gag tea gca ttc ccc gat 
Met ser asp gly glu ser ala ala pro trp ala arg leu ser glu ser ala phe pro asp 
61/21 91/31 

ggt gtt gac cga tgg ate acg gta ccg ccc gee aca tgg gtg gca gee cag ggt ccg egg 
gly val asp arg trp ile thr val pro pro ala thr trp val ala ala gin gly pro arg 
121/41 151/51 

gac ace cag aat gtc ggc tgt cat gee ace ggc gee gtt agt gtg gee gat ctg ate gee 
asp thr gin asn val gly cys his ala thr gly ala val ser val ala asp leu ile ala 
181/61 211/71 

agg etc ggc ccc get ttt cct gac etc ccc acg cac cgc cat gtc gee ccc gaa ccc gag 
arg leu gly pro ala phe pro asp leu pro thr his arg his val ala pro glu pro glu 
241/81 271/91 

cca tec ggc cgc ggc ccg aag gtc cac gac gac gee gac gac cag cag gac ace gag get 
pro ser gly arg gly pro lys val his asp asp ala asp asp gin gin asp thr glu ala 
301/101 331/111 

ate gee ate ccg gec cac teg etc gag ttc etc teg gag ctt ccc gac etc egg gca gee 
ile ala ile pro ala his ser leu glu phe leu ser glu leu pro asp leu arg ala ala 
361/121 391/131 

aac tat ccg cgc gee gac cac gee cgc cgt gaa ccc gag eta ccc ggc aag cag eta ace 
asn tyr pro arg ala asp his ala arg arg glu pro glu leu pro gly lys gin leu thr 
421/141 451/151 

gga teg get cga gtg egg cca ttg egg ate cgc cga acg teg ccc gcg ccc gee aag cca 
gly ser ala arg val arg pro leu arg ile arg arg thr ser pro ala pro ala lys pro 
481/161 511/171 

gcg ccg aac tec ggc egg cgc ccg atg gtg ctg gee gcg cgc teg ctg gcg get ctg ttt 
ala pro asn ser gly arg arg pro met val leu ala ala arg ser leu ala ala leu phe 
541/181 571/191 

gee get ctg gcg ttg gcg ctg acc ggc ggg gca tgg cag tgg age gcg teg aag aac age 
ala ala leu ala leu ala leu thr gly gly ala trp gin trp ser ala ser lys asn ser 
601/201 631/211 

egg ctg aac atg gta age gcg etc gac ccg cat teg ggc gac ate gtc aac ccc age ggg 
arg leu asn met val ser ala leu asp pro his ser gly asp ile val asn pro ser gly 

SEQ ID No.32D 
FIGURE 32D 
REPLACEMENT SHEET (RULE 26) 



# 



99/185 



661/221 691/231 

cag cat ggc gac gag aac ttc ttg etc gtc ggt atg gac tct cgt gec ggg gcg aac gec 
qln his gly asp glu asn phe leu leu val gly met asp ser arg ala gly ala asn ala 
721/241 751/251 

aat ate ggc gec ggc gac gec gag gac gec ggc ggc gca cgt teg gac acc gtc atg ctg 
asn ile gly ala gly asp ala glu asp ala gly gly ala arg ser asp thr val met leu 
781/261 811/271 

gtc aac att ccg gee age cgc gag egg gtc gtc gcg gtg teg ttc ccc cgc gac ctg gcg 
val asn ile pro ala ser arg glu arg val val ala val ser phe pro arg asp leu ala 
841/281 871/291 

ate act cca ate caa tgc gag gcg tgg aac ccc gag acc ggt aag tac gga ccc ate tac 
ile thr pro ile gin cys glu ala trp asn pro glu thr gly lys tyr gly pro ile tyr 
901/301 931/311 

gac gag aag acg gga acg atg ggt ccc aga ctg gtg tac acg gag acc aag ctg aac teg 
asp glu lys thr gly thr met gly pro arg leu val tyr thr glu thr lys leu asn ser 
961/321 991/331 

gca ttc tec ttc ggc ggg cct aag tgt eta gtg aag gtc att cag aaa ctg teg ggc ttg 
ala phe ser phe gly gly pro lys cys leu val lys val ile gin lys leu ser gly leu 
1021/341 .1051/351 

age ate aac egg ttc ate gcg att gac ttc gtc ggt ttc gcg egg atg gtc gag gee etc 
ser ile asn arg phe ile ala ile asp phe val gly phe ala arg met val glu ala leu 
1081/361 1111/371 

ggc ggc gtc gag gta tgc age acc acc ccg ttg egg gac tac gaa ctg ggc acg gtg ctg 
aly gly val glu val cys ser thr thr pro leu arg asp tyr glu leu gly thr val leu 
1141/381 1171/391 

gag cac gec gga cgc cag gtc att gac ggg ccg acc gcg ctg aac tat gtg cgc get cgc 
glu his ala gly arg gin val ile asp gly pro thr ala leu asn tyr val arg ala arg 
1201/401 1231/411 

cag gtc acc acc gag age aat ggc gac tac ggg cgc ate aaa cgc cag cag ttg ttt ttg 
gin val thr thr glu ser asn gly asp tyr gly arg ile lys arg gin gin leu phe leu 
1261/421 1291/431 

teg teg ctg ctg cgt teg atg ate teg acg gac acc ttg ttc aac etc age agg etc aac 
ser ser leu leu arg ser met ile ser thr asp thr leu phe asn leu ser arg leu asn 
1321/441 1351/451 

aac gtc gtc aac atg ttc ate ggt aac age tac gtg gac aac gtc aag acc aaa gac ctg 
asn val val asn met phe ile gly asn ser tyr val asp asn val lys thr lys asp leu 
1381/461 1411/471 

gtc gaa etc ggt cga teg ttg cag cat atg gcg gee ggg cac gtc acg ttc gtg acc gtt 
val glu leu gly arg ser leu gin his met ala ala gly his val thr phe val thr val 
1441/481 1471/491 

ccg acc ggt ata acc gac cag aac ggc gac gag ccc ccg cgt acc tec gac atg aag gcg 
pro thr gly ile thr asp gin asn gly asp glu pro pro arg thr ser asp met lys ala 
1501/501 1531/511 

ctt ttc acc gee ate ate gac gac gat ccg ctg ccc ctg gaa aac gat cac aac gee cag 
leu phe thr ala ile ile asp asp asp pro leu pro leu glu asn asp his asn ala gin 
1561/521 1591/531 

cgt ctg ggc aac acg ccg teg acc ccg ccg acc acc acc aag aag gcg ccg cag gcg ggt 
arg leu gly asn thr pro ser thr pro pro thr thr thr lys lys ala pro gin ala gly 
1621/541 . 1651/551 

ctg acc aac gag att cag cac cag cag gtt acg acg acc teg cca aaa gag gtc aca gtg 
leu thr asn glu ile gin his gin gin val thr thr thr ser pro lys glu val thr val 
1681/561 1711/571 

cag gtc tct aac teg acc ggc cag gee ggt ttg gee acc acc gec acc gat cag etc aag 
qln val ser asn ser thr gly gin ala gly leu ala thr thr ala thr asp gin leu lys 
1741/581 1771/591 

egg aac ggc ttc aac gtg atg get ccg gac gac tac ccg agt teg ctg ctg gee acc aca 
arg asn gly phe asn val met ala pro asp asp tyr pro ser ser leu leu ala thr thr 
1801/601 1831/611 

gtg ttt ttt teg ccc ggc aac gaa cag get gec gec acc gtg gee gee gtg ttc ggc cag 
val phe phe ser pro gly asn glu gin ala ala ala thr val ala ala val phe gly gin 
1861/621 1891/631 

tea aag ate gag egg gtg acc ggg ate ggc caa ctg gtc cag gtg gtg ctg ggc caa gac 
ser lys ile glu arg val thr gly ile gly gin leu val gin val val leu gly gin asp 

SEQ ID No.32D (continued 1) 
FIGURE 32D (continued 1) 



REPLACEMENT SHEET (RULE 26) 



100/185 



1921/641 

ttc age gcg gtg cgc 
phe ser ala val arg 
1981/661 

aac tec tec age cca 
asn ser ser ser pro 
2041/681 

acc ace tgc gag tag 
thr thr cys glu AMB 



get ccc ctg ccg agt 
ala pro leu pro ser 

ccg acc aag ctg ccc 
pro thr lys leu pro 



1951/651 

ggc tec acc gtc age 
gly' ser thr val ser 
2011/671 

gag gac ctg acg gtc 
glu asp leu thr val 



gtg cag ata age cgc 
val gin ile ser arg 

acc aac gec gec gac 
thr asn ala ala asp 



SEQ ID No.32D (continued 2) 
FIGURE 32D (continued 2) 



ORF according to Cole et al. (Nature 393:537-544) and containing Rv0822c 
1/1 . 31/11 

tag gac atg agt gac ggc gag age gec gcg ccg tgg gca egg etc tec gag tea gca ttc 
AMB asp met ser asp gly glu ser ala ala pro trp ala arg leu ser glu ser ala phe 
61/21 91/31 

ccc gat ggt gtt gac cga tgg ate acg gta ccg ccc gee aca tgg gtg gca gec cag ggt 
pro asp gly val asp arg trp ile thr val pro pro ala thr trp val ala ala gin gly 
121/41 151/51 

ccg egg gac acc cag aat gtc ggc tgt cat gec acc ggc gee gtt agt gtg gee gat ctg 
pro arg asp thr gin asn val gly cys his ala thr gly ala val ser val ala asp leu 
181/61 211/71 

ate gee agg etc ggc ccc get ttt cct gac etc ccc acg cac cgc cat gtc gec ccc gaa 
ile ala arg leu gly pro ala phe pro asp leu pro thr his arg his val ala pro glu 
241/81 271/91 

ccc gag cca tec ggc cgc ggc ccg aag gtc cac gac gac gee gac gac cag cag gac acc 
pro glu pro ser gly arg gly pro lys val his asp asp ala asp asp gin gin asp thr 
301/101 331/111 

gag get ate gee ate ccg gee cac teg etc gag ttc etc teg gag ctt ccc gac etc egg 
glu ala ile ala ile pro ala his ser leu glu phe leu ser glu leu pro asp leu arg 
361/121 391/131 

gca gee aac tat ccg cgc gec gac cac gec cgc cgt gaa ccc gag eta ccc ggc aag cag 
ala ala asn tyr pro arg ala asp his ala arg arg glu pro glu leu pro gly lys gin 
421/141 451/151. 

eta acc gga teg get cga gtg egg cca ttg egg ate cgc cga acg teg ccc gcg ccc gee 
leu thr gly ser ala arg val arg pro leu arg ile arg arg thr ser pro ala pro ala 
481/161 511/171 

aag cca gcg ccg aac tec ggc egg cgc ccg atg gtg ctg gee gcg cgc teg ctg gcg get 
lys pro ala pro asn ser gly arg arg pro met val leu ala ala arg ser leu ala ala 
541/181 571/191 

ctg ttt gee get ctg gcg ttg gcg ctg acc ggc ggg gca tgg cag tgg age gcg teg aag 
leu phe ala ala leu ala leu ala leu thr gly gly ala trp gin trp ser ala ser lys 
601/201 631/211 

aac age egg ctg aac atg gta age gcg etc gac ccg cat teg ggc gac ate gtc aac ccc 
asn ser arg leu asn met val ser ala leu asp pro his ser gly asp ile val asn pro 
661/221 691/231 

age ggg cag cat ggc gac gag aac ttc ttg etc gtc ggt atg gac tct cgt gec ggg gcg 
ser gly gin his gly asp glu asn phe leu leu val gly met asp ser arg ala gly ala 
721/241 751/251 

aac gee aat ate ggc gee ggc gac gec gag gac gee ggc ggc gca cgt teg gac acc gtc 
asn ala asn ile gly ala gly asp ala glu asp ala gly gly ala arg ser asp thr val 
781/261 811/271 

atg ctg gtc aac att ccg gee age cgc gag egg gtc gtc gcg gtg teg ttc ccc cgc gac 
met leu val asn ile pro ala ser arg glu arg val val ala val ser phe pro arg asp 



SEQ ID No.32F 



FIGURE 32F 
REPLACEMENT SHEET (RULE 26) 



101/185 

841/281 ■ 871/291 

ctg gcg ate act cca ate caa tgc gag gcg tgg aac ccc gag acc ggt aag tac gga ccc 
leu ala ile thr pro ile gin cys glu ala trp asn pro glu thr gly lys tyr gly pro 
901/301 931/311 

ate tac gac gag aag acg gga acg atg ggt ccc aga ctg gtg tac acg gag acc aag ctg 
ile tyr asp glu lys thr gly thr met gly pro arg leu val tyr thr glu thr lys leu 
961/321 991/331 

aac teg gca ttc tec ttc ggc ggg cct aag tgt eta gtg aag gtc att cag aaa ctg teg 
asn ser ala phe ser phe gly gly pro lys cys leu val lys val ile gin lys leu ser 
1021/341 1051/351 

ggc ttg age ate aac egg ttc ate gcg att gac ttc gtc ggt ttc gcg egg atg gtc gag 
gly leu ser ile asn arg phe ile ala ile asp phe val gly phe ala arg met val glu 
1081/361 1111/371 

gec etc ggc ggc gtc gag gta tgc age acc acc ccg ttg egg gac tac gaa ctg ggc acg 
ala leu gly gly val glu val cys ser thr thr pro leu arg asp tyr glu leu gly thr 
1141/381 1171/391 

gtg ctg gag cac gee gga cgc cag gtc att gac ggg ccg acc gcg ctg aac tat gtg cgc 
val leu glu his ala gly arg gin val ile asp gly pro thr ala leu asn tyr val arg 
1201/401 . 1231/411 

get cgc cag gtc acc acc gag age aat ggc gac tac ggg cgc ate aaa cgc cag cag ttg 
ala arg gin val thr thr glu ser asn gly asp tyr gly arg ile lys arg gin gin leu 
1261/421 1291/431 

ttt ttg teg teg ctg ctg cgt teg atg ate teg acg gac acc ttg ttc aac etc age agg 
phe leu ser ser leu leu arg ser met ile ser thr asp thr leu phe asn leu ser arg 
1321/441 1351/451 

etc aac aac gtc gtc aac atg ttc ate ggt aac age tac gtg gac aac gtc aag acc aaa 
leu asn asn val val asn met phe ile gly asn ser tyr val asp asn val lys thr lys 
1381/461 1411/471 

gac ctg gtc gaa etc ggt cga teg ttg cag cat atg gcg gee ggg cac gtc acg ttc gtg 
asp leu val glu leu gly arg ser leu gin his met ala ala gly his val thr phe val 
1441/481 " 1471/491 

acc gtt ccg acc ggt ata acc gac cag aac ggc gac gag ccc ccg cgt acc tec gac atg 
thr val pro thr gly ile thr asp gin asn gly asp glu pro pro arg thr ser asp met 
1501/501 1531/511 

aag gcg ctt ttc acc gee ate ate gac gac gat ccg ctg ccc ctg gaa aac gat cac aac 
lys ala leu phe thr ala ile ile asp asp asp pro leu pro leu glu asn asp his asn 
1561/521 1591/531 

gec cag cgt ctg ggc aac acg ccg teg acc ccg ccg acc acc acc aag aag gcg ccg cag 
ala gin arg leu gly asn thr pro ser thr pro pro thr thr thr lys lys ala pro gin 
1621/541 1651/551 

gcg ggt ctg acc aac gag att cag cac cag cag gtt acg acg acc teg cca aaa gag gtc 
ala gly leu thr asn glu ile gin his gin gin val thr thr thr ser pro lys glu val 
1681/561 1711/571 

aca gtg cag gtc tct aac teg acc ggc cag gee ggt ttg gec acc acc gec acc gat cag 
thr val gin val ser asn ser thr gly gin ala gly leu ala thr thr ala thr asp gin 
1741/581 1771/591 

etc aag egg aac ggc ttc aac gtg atg get ccg gac gac tac ccg agt teg ctg ctg gee 
leu lys arg asn gly phe asn val met ala pro asp asp tyr pro ser ser leu leu ala 
1801/601 1831/611 

acc aca gtg ttt ttt teg ccc ggc aac gaa cag get gec gee acc gtg gee gee gtg ttc 
thr thr val phe phe ser pro gly asn glu gin ala ala ala thr val ala ala val phe 
1861/621 1891/631 

ggc cag tea aag ate gag egg gtg acc ggg ate ggc caa ctg gtc cag gtg gtg ctg ggc 
gly gin ser lys ile glu arg val thr gly ile gly gin leu val gin val val leu gly 
1921/641 1951/651 

caa gac ttc age gcg gtg cgc get ccc ctg ccg agt ggc tec acc gtc age gtg cag ata 
gin asp phe ser ala val arg ala pro leu pro ser gly ser thr val ser val gin ile 
1981/661 2011/671 

age cgc aac tec tec age cca ccg acc aag ctg ccc gag gac ctg acg gtc acc aac gec 
ser arg asn ser ser ser pro pro thr lys leu pro glu asp leu thr val thr asn ala 
2041/681 

gec gac acc acc tgc gag tag 
ala asp thr thr cys glu AMB 

SEQ ID 32F (continued 1) 
FIGURE 32 F (continued 1) 



REPLACEMENT SHEET (RULE 26) 



• 



102/185 



1/1 

CGT CAC CTC TGC CAT GGT CCA 
arg his leu cys his gly pro 
61/21 

CAT GCA GAG TCG GTG TTC GCT 
his ala glu ser val phe ala 
121/41 

GGC CGA CAT TCG CCC GAG GCC 
gly arg his ser pro glu ala 
181/61 

ATT GAT ACG ATT GCG CAC ATG 
ile asp thr ile ala his met 



31/11 

TCT ACG GTA TCT GCG ACA AGG 
ser thr val ser ala thr arg 
91/31 

TCA CGC GAA CTA GGC GCG CCT 
ser arg glu leu gly ala pro 
151/51 

TTG GCC TCC ATC ACC TAA TTG 
leu ala ser ile thr OCH leu 

211/71 
GCT ATC TGG GAT C 
ala ile trp asp 



GCA GCG TCG ATC CCT CGA 
ala ala ser ile pro arg 

AGC CTG GAC GAG TCC CCG 
ser leu asp glu ser pro 

TGT GCA AAA CCG TAT CTA 
cys ala lys pro tyr leu 



SEQ ID No.33A 



FIGURE 33A 



1/1 

GTC ACC TCT GCC ATG GTC CAT 
val thr ser ala met val his 
61/21 

ATG CAG AGT CGG TGT TCG CTT 
met gin ser arg cys ser leu 
121/41 

GCC GAC ATT CGC CCG AGG CCT 
ala asp ile arg pro arg pro 
181/61 

TTG ATA CGA TTG CGC ACA TGG 
leu ile arg leu arg thr trp 









31/11 




CTA 


CGG 


TAT 


CTG CGA 


CAA 


leu 


arg 


tyr 


leu arg 


gin 








91/31 




CAC 


GCG 


AAC 


TAG GCG 


CGC 


his 


ala 


asn 


AMB ala 


arg 








151/51 




TGG 


CCT 


CCA 


TCA CCT 


AAT 


trp 


pro 


pro 


ser pro 


asn 








211/71 




CTA 


TCT 


GGG 


ATC 




leu 


ser 


gly 


ile 





SEQ ID No.33B 
FIGURE 33B 



GGG CAG CGT CGA TCC CTC GAC 
gly gin arg arg ser leu asp 

CTA GCC TGG ACG AGT CCC CGG 
leu ala trp thr ser pro arg 

TGT GTG CAA AAC CGT ATC TAA 
cys val gin asn arg ile OCH 



1/1 

CCG TCA CCT CTG CCA TGG TCC 
pro ser pro leu pro trp ser 
61/21 

ACA TGC AGA GTC GGT GTT CGC 
thr cys arg val gly val arg 
121/41 

GGG CCG ACA TTC GCC CGA GGC 
gly pro thr phe ala arg gly 
181/61 

AAT TGA TAC GAT TGC GCA CAT 
asn OPA tyr asp cys ala his 



31/11 



ATC 


TAC 


GGT 


ATC TGC 


GAC 


AAG 


ile 


tyr 


gly 


ile cys 


asp 


lys 








91/31 






TTC 


ACG 


CGA 


ACT AGG 


CGC 


GCC 


phe 


thr 


arg 


thr arg 


arg 


ala 








151/51 






CTT 


GGC 


CTC 


CAT CAC 


CTA 


ATT 


leu 


gly 


leu 


his his 


leu 


ile 








211/71 






GGC 


TAT 


CTG 


GGA TC 






gly 


tyr 


leu 


gly 







GGC AGC GTC GAT CCC TCG 
gly ser val asp pro ser 

TAG CCT GGA CGA GTC CCC 
AMB pro gly arg val pro 

GTG TGC AAA ACC GTA TCT 
val cys lys thr val ser 



SEQ ID No.33C 



FIGURE 33C 



REPLACEMENT SHEET (RULE 2€) 



103/185 



sequence Rvl044 predicted 
seq33A 

1/1 

ttg tgt gca aaa ccg tat eta 
leu cys ala lys pro tyr leu 
61/21 

gtc gag gtt gee gee gag caa 
val glu val ala ala glu gin 
121/41 

gtc gac cct gtg cag etc cgc 
val asp pro val gin leu arg 
181/61 

ggt gtg tac egg gtg ccc gtg 
gly val tyr arg val pro val 
241/81 

teg tgg act ttg ggg cgt ggc 
ser trp thr leu gly arg gly 
301/101 

get gac gtg aac ccg teg cgc 
ala asp val asn pro ser arg 
361/121 

gee ggg ggc gag ctg tac cga 
ala gly gly glu leu tyr arg 
421/141 

gtc gac gga ata ccc gtc acg 
val asp gly ile pro val thr 
481/161 

acg gat cct tat cag ctt egg 
thr asp pro tyr gin leu arg 
541/181 

cgt ggg tea gca get gag eta 
arg gly ser ala ala glu leu 
601/201 

egg ccg aag cga gca teg gcg 
arg pro lys arg ala ser ala 



by Cole et al. (Nature 



31/11 

att gat acg att gcg cac 
ile asp thr ile ala his 
91/31 

cat ggc tac gtc acg act 
his gly tyr val thr thr 
151/51 

etc eta gcg ggg cgc gga 
leu leu ala gly arg gly 

211/71 
ctg ccg cgt ggt gag cac 
leu pro arg gly glu his 

271/91 

gtt ate teg cat gag teg 
val ile ser his glu ser 

331/111 
ate cat etc ace gtc ccg 
ile his leu thr val. pro 

391/131 
gtt cac cgc cgc gac etc 
val his arg arg asp leu 

451/151 
acg gtt gcg cgc ace ate 
thr val ala arg thr ile 

511/171 
gee gcg ate gag cga gee 
ala ala ile glu arg ala 

571/191 
cgc get gcg etc gat gag 
arg ala ala leu asp glu 

tga 
OPA 



393:537-544) and containing 



atg get ate tgg gat cgc etc 
met ala ile trp asp arg leu 

cgc gat gcg cga gac ate ggc 
arg asp ala arg asp ile gly 

cgt ctt gag cgt gtc ggc cga 
arg leu glu arg val gly arg 

gac gat etc gca gee gca gtg 
asp asp leu ala ala ala val 

gee ttg gcg ctt cat gee etc 
ala leu ala leu his ala leu 

cgc aac aac cat ccg cgt gcg 
arg asn asn his pro arg ala 

cag gca gee cac gtc act teg 
gin ala ala his val thr ser 

aaa gac tgc gtg aag acg ggc 
lys asp cys val lys thr gly 

gaa gee gag ggc acg ctt cgt 
glu ala glu gly thr leu arg 

ace act gee gga tta cgc get 
thr thr ala gly leu arg ala 



SEQ ID No.33D 
FIGURE 33D 



REPLACEMENT SHEET (RULE 26) 



104/185 



ORF according to Cole et al. (Nature 393:537-544) and containing Rvl044 
1/1 31/11 

taa ttg tgt gca aaa ccg tat eta att gat acg att gcg cac atg get ate tgg gat cgc 
OCH leu cys ala lys pro tyr leu ile asp thr ile ala his met ala ile trp asp arg 
61/21 91/31 

etc gtc gag gtt gee gee gag caa cat ggc tac gtc acg act cgc gat gcg cga gac ate, 
leu val glu val ala ala glu gin his gly tyr val thr thr arg asp ala arg asp ile 
121/41 151/51 

ggc gtc gac cct gtg cag etc cgc etc eta gcg ggg cgc gga cgt ctt gag cgt gtc ggc 
gly val asp pro val gin leu arg leu leu ala gly arg gly arg leu glu arg val gly 
181/61 . 211/71 

cga ggt gtg tac egg gtg ccc gtg ctg ccg cgt ggt gag cac gac gat etc gca gee gca 
arg gly val tyr arg val pro val leu pro arg gly glu his asp asp leu ala ala ala 
241/81 271/91 

gtg teg tgg act ttg ggg cgt ggc gtt ate teg cat gag teg gee ttg gcg ctt cat gec 
val ser trp thr leu gly arg gly val ile ser his glu ser ala leu ala leu his ala 
301/101 331/111 

etc get gac gtg aac ccg teg cgc ate cat etc acc gtc ccg cgc aac aac cat ccg cgt 
leu ala asp val asn pro ser arg ile his leu thr val pro arg asn asn his pro arg 
361/121 391/131 

gcg gee ggg ggc gag ctg tac cga gtt cac cgc cgc gac etc cag gca gee cac gtc act 
ala ala gly gly glu leu tyr arg val his arg arg asp leu gin ala ala his val thr 
421/141 451/151 

teg gtc gac gga ata ccc gtc acg acg gtt gcg cgc acc ate aaa gac tgc gtg aag acg 
ser val asp gly ile pro val thr thr val ala arg thr ile lys asp cys val lys thr 
481/161 511/171 

ggc acg gat cct tat cag ctt egg gee gcg ate gag cga gee gaa gee gag ggc acg ctt 
gly thr asp pro tyr gin leu arg ala ala ile glu arg ala glu ala glu gly thr leu 
541/181 571/191 

cgt cgt ggg tea gca get gag eta cgc get gcg etc gat gag acc act gee gga tta cgc 
arg arg gly ser ala ala glu leu arg ala ala leu asp glu thr thr ala gly leu arg 
601/201 

get egg ccg aag cga gca teg gcg tga 
ala arg pro lys arg ala ser ala OPA 

SEQ ID No.33F 
FIGURE 33F 



■ 1/1 

ATC CAA CCT GCT GGG CCT GCG 
ile gin pro ala gly pro ala 
61/21 

ACA ACA CCT GGA ATG GGG ACC 
thr thr pro gly met gly thr 
121/41 

CCT CGG TCG AGA CGT ATC GCG 
pro arg ser arg arg ile ala 
181/61 

CGC CGC ATA TTT GCC GCG CTG 
arg arg ile phe ala ala leu 









31/11 




CCT 


TCG 


AAT 


CGA CGG 


CCA 


pro 


ser 


asn 


arg arg 


pro 








91/31 




TTT 


TCG 


GTG 


TTG CTG 


GTA 


phe 


ser 


val 


leu leu 


val 






151/51 




GCA 


GCG 


TTG 


GCC CTG 


TCG 


ala 


ala 


leu 


ala leu 


ser 








211/71 




CCG 


CGG 


GCC 


GGA TC 




pro 


arg 


ala 


gly 





SEQ ID No.34A 
FIGURE 34A 



GGC CAC CGC TCG CTG CCG GCA 
gly his arg ser leu pro ala 

ACC GGG ACA ACC GGC ACC ACG 
thr gly thr thr gly thr thr 

TTG CTG ACA ATT ACC GCT GGC 
leu leu thr ile thr ala gly 



REPLACEMENT SHEET (RULE 26) 




105/185 



1/1 


















31/11 


















TCC AAC 


CTG 


CTG 


GGC 


CTG 


CGC 


CTT 


CGA 


ATC 


GAC GGC 


CAG 


GCC 


ACC 


GCT 


CGC 


TGC 


CGG 


CAA 


ser asn 


leu 


leu 


gly 


leu 


arg 


leu 


arg 


ile 


asp gly 


gin 


ala 


thr 


ala 


arg 


cys 


arg 


gin 


61/21 


















91/31 


















CAA CAC 


CTG 


GAA 


TGG 


GGA 


CCT 


TTT 


CGG 


TGT 


TGC TGG 


TAA 


CCG 


GGA 


CAA 


CCG 


GCA 


CCA 


CGC 


gin his 


leu 


glu 


trp 


gly 


pro 


phe 


arg 


cys 


cys trp 


OCH 


pro 


gly 


gin 


pro 


ala 


pro 


arg 


121/41 


















151/51 


















CTC GGT 


CGA 


GAC 


GTA 


TCG 


CGG 


CAG 


CGT 


TGG 


CCC TGT 


CGT 


TGC 


TGA 


CAA 


TTA 


CCG 


CTG 


GCC 


leu gly 


arg 


asp 


val 


ser 


arg 


gin 


arg 


trp 


pro cys 


arg 


cys 


OPA gin 


leu 


pro 


leu 


ala 


181/61 
















211/71 


















GCC GCA 


TAT 


TTG 


CCG 


CGC 


TGC 


CGC 


GGG 


CCG 


GAT C 


















ala ala 


tyr 


leu 


pro 


arg 


cys 


arg 


gly 


pro 


asp 



















SEQ ID No.34B 
FIGURE 34B 



1/1 

GAT CCA ACC TGC 
asp pro thr cys 
61/21 

AAC AAC ACC TGG 
asn asn thr trp 
121/41 

GCC TCG GTC GAG 
ala ser val glu 
181/61 

CCG CCG CAT ATT 
pro pro his ile 



TGG GCC TGC GCC 
trp ala cys ala 

AAT GGG GAC CTT 
asn gly asp leu 

ACG TAT CGC GGC 
thr tyr arg gly 

TGC CGC GCT GCC 
cys arg ala ala 







31/11 


TTC 


GAA 


TCG ACG 


phe 


glu 


ser thr 






91/31 


TTC 


GGT 


GTT GCT 


phe 


gly 


val ala 






151/51 


AGC 


GTT 


GGC CCT 


ser 


val 


gly pro 






211/71 


GCG 


GGC 


CGG ATC 


ala 


gly 


arg ile 



SEQ ID No.34C 
FIGURE 34C 



GCC AGG CCA CCG 

ala arg pro pro 

GGT AAC CGG GAC 

gly asn arg asp 

GTC GTT GCT GAC 

val val ala asp 



CTC GCT GCC GGC 
leu ala ala gly 

AAC CGG CAC CAC 
asn arg his his 

AAT TAC CGC TGG 
asn tyr arg trp 



REPLACEMENT SHEET (RULE 26) 




106/185 

ORF according to Cole et al. (Nature 393:537-544) containing seq34A 
1/1 31/11 

tag ccg cag ggc cct gcg get agg cgc ggc egg tgc cgt tgg ccg egg egg caa teg atg 
AMB pro gin gly pro ala ala arg arg gly arg cys arg trp pro arg arg gin ser met 
61/21 91/31 

ttg cag cag tta caa cgc caa atg gag tct gag cgc ate gtc gag ttc gat cag etc ggc 
leu gin gin leu gin arg gin met glu ser glu arg ile val glu phe asp gin leu gly 
121/41 151/51 

agg gga gac gtt gcg cag cga egg ate caa cct get ggg cct gcg cct teg aat cga egg 
arg gly asp val ala gin arg arg ile gin pro ala gly pro ala pro ser asn arg arg 
181/61 211/71 

cca ggc cac cgc teg ctg ccg gca aca aca cct gga atg ggg ace ttt teg gtg ttg ctg 
pro gly his arg ser leu pro ala thr thr pro gly met gly thr phe ser val leu leu 
241/81 271/91 

gta ace ggg aca ace ggc ace acg cct egg teg aga cgt ate gcg gca gcg ttg gee ctg 
val thr gly thr thr gly thr thr pro arg ser arg arg ile ala ala ala leu ala leu 
301/101 331/111 

teg ttg ctg aca att ace get ggc cgc cgc ata ttt gee gcg ctg ccg egg gee gga tec 
ser leu leu thr ile thr ala gly arg arg ile phe ala ala leu pro arg ala gly ser 
361/121 391/131 

agg teg ace tgc cag ate tea ccg cgc age ate tac gee gtt cgc tgc aaa ccg ccg act 
arg ser thr cys gin ile ser pro arg ser ile tyr ala val arg cys lys pro pro thr 
421/141 451/151 

gcg acg gca ggc cca etc tct tgg cat gcg tec aat get gcg acg tec teg gta gac aag 
ala thr ala gly pro leu ser trp his ala ser asn ala ala thr ser ser val asp lys 
481/161 511/171 

etc acg ctt ggc ttc atg ccg cag tec tac cca tgt agt aac aga tag 
leu thr leu gly phe met pro gin ser tyr pro cys ser asn arg AMB 

SEQ ID No.34F 
FIGURE 34F 

1/1 31/11 
CAG TCT GTC GGC AAG GAG GGA CGC A' 
gin ser val gly lys glu gly arg im 
61/21 

CAG ATC GAG AGC GCT CTC TAC GCC G 
gin ile glu ser ala leu tyr ala g 
121/41 

GGC TTC CGC GCA CCG ACC GCG CGG C 
gly phe arg ala pro thr ala arg a 
181/61 

CTG GGG ATG TTG GTT TCC GGC GTG G 
leu gly met leu val ser gly val a 
241/81 

CTC AGC GTT TTC GGT TTT GTC GTG A 
leu ser val phe gly phe val val m 
301/101 

CGG TTG TCC GGC AGG ATG GAT CGT G 
arg leu ser gly arg met asp arg g 
361/121 



SEQ ID NO-35A 



CCA 


CTC TCC 


GAT 


CAT 


GAG 


CAG 


CGG 


ATG 


CTT 


GAC 


pro 


leu ser 
91/31 


asp 


his 


glu 


gin 


arg 


met 


leu 


asp 


GAT 


CCC AAG 


TTC 


GCA 


TCG 


AGT 


GTC 


CGT 


GGC 


GGG 


asp 


pro lys 
151/51 


phe 


ala 


ser 


ser 


val 


arg 


gly 


giy 


CGC 


CTG CAG 


GGC 


GCG 


GCG 


TTG 


TTC 


ATC 


ATC 


GGT 


arg 


leu gin 
211/71 


gly 


ala 


ala 


leu 


phe 


ile 


ile 


gly 


TTC 


AAA GAG 


ACC 


ATG 


ATC 


GGA 


AGT 


TTC 


CCG 


ATA 


phe 


lys glu 
271/91 


thr 


met 


ile 


gly 


ser 


phe 


pro 


ile 


TTC 


GGT GGT 


GTG 


GTG 


TAT 


GCC 


ATC 


ACC 


GGT 


CCT 


phe 


gly gly 
331/111 


val 


val 


tyr 


ala 


ile 


thr 


giy 


pro 


GGA 


TCG GCT 


GCT 


GGG 


GCT 


TCG 


CGC 


CAG 


CGT 


CGT 


gly 


ser ala 


ala 


giy 


ala 


ser 


arg 


gin 


arg 


arg 


391/131 


















AGC 


CGT ATG 


GAA 


GAT 


C 












ser 


arg met 


glu 


asp 















FIGURE 35A 
REPLACEMENT SHEET (RULE 26) 



107/185 



1/1 31/11 

GAC AGT CTG TCG GCA AGG AGG GAC GCA TGC CAC TCT CCG ATC ATG AGC AGC GGA TGC TTG 
asp ser leu ser ala arg arg asp ala cys his ser pro ile met ser ser gly cys leu 
61/21 91/31 

ACC AGA TCG AGA GCG CTC TCT ACG CCG AAG ATC CCA AGT .TCG CAT CGA GTG TCC GTG GCG 
thr arg ser arg ala leu ser thr pro lys ile pro ser ser his arg val ser val ala 
121/41 151/51 

GGG GCT TCC GCG CAC CGA CCG CGC GGC GGC GCC TGC AGG GCG CGG CGT TGT TCA TCA TCG 
gly ala ser ala his arg pro arg gly gly ala cys arg ala arg arg cys ser ser ser 
181/61 211/71 

GTC TGG GGA TGT TGG TTT CCG GCG TGG CGT TCA AAG AGA CCA TGA TCG GAA GTT TCC CGA 
val trp gly cys trp phe pro ala trp arg ser lys arg pro OPA ser glu val ser arg 
241/81 271/91 

TAC TCA GCG TTT TCG GTT TTG TCG TGA TGT TCG GTG GTG TGG TGT ATG CCA TCA CCG GTC 
tyr ser ala phe ser val leu ser OPA cys ser val val trp cys met pro ser pro val 
301/101 331/111 

CTC GGT TGT CCG GCA GGA TGG ATC GTG GCG GAT CGG CTG CTG GGG CTT CGC GCC AGC GTC 

leu gly cys pro ala gly trp ile val ala asp arg leu leu gly leu arg ala ser val 

361/121 391/131 

GTA CCA AGG GGG CCG GGG GCT CAT TCA CCA GCC GTA TGG AAG ATC 

val pro arg gly pro gly ala his ser pro ala val trp lys ile 



SEQ ID No.35B 
FIGURE 35B 



1/1 

ACA GTC TGT CGG CAA GGA 
thr val cys arg gin gly 
61/21 

CCA GAT CGA GAG CGC TCT 
pro asp arg glu arg ser 
121/41 

GGG CTT CCG CGC ACC GAC 
gly leu pro arg thr asp 
181/61 

TCT GGG GAT GTT GGT TTC 
ser gly asp val gly phe 
241/81 

ACT CAG CGT TTT CGG TTT 
thr gin arg phe arg phe 
301/101 

TCG GTT GTC CGG CAG GAT 
ser val val arg gin asp 
361/121 

TAC CAA GGG GGC CGG GGG 
tyr gin gly gly arg gly 











31/11 




GGG 


ACG 


CAT 


GCC 


ACT CTC 


CGA 


gly 


thr 


his 


ala 


thr leu 


arg 










91/31 




CTA 


CGC 


CGA 


AGA 


TCC CAA 


GTT 


leu 


arg 


arg 


arg 


ser gin 


val 










151/51 




CGC 


GCG 


GCG 


GCG 


CCT GCA 


GGG 


arg 


ala 


ala 


ala 


pro ala 


gly 










211/71 




CGG 


CGT 


GGC 


GTT 


CAA AGA 


GAC 


arg 


arg 


gly 


val 


gin arg 


asp 










271/91 




TGT 


CGT 


GAT 


GTT 


CGG TGG 


TGT 


cys 


arg 


asp 


val 


arg trp 


cys 










331/111 




GGA 


TCG 


TGG 


CGG 


ATC GGC 


TGC 


gly 


ser 


trp 


arg 


ile gly 


cys 










391/131 




CTC 


ATT 


CAC 


CAG 


CCG TAT 


GGA 


leu 


ile 


his 


gin 


pro tyr 


gly 



TCA TGA GCA GCG GAT GCT TGA 
ser OPA ala ala asp ala OPA 

CGC ATC GAG TGT CCG TGG CGG 
arg ile glu cys pro trp arg 

CGC GGC GTT GTT CAT CAT CGG 
arg gly val val his his arg 

CAT GAT CGG AAG TTT CCC GAT 
his asp arg lys phe pro asp 

GGT GTA TGC CAT CAC CGG TCC 
gly val cys his his arg ser 

TGG GGC TTC GCG CCA GCG TCG 
trp gly phe ala pro ala ser 

AGA TC 
arg 



SEQ ID No.35C 
FIGURE 35C 
REPLACEMENT SHEET (RULE 26) 



V 



108/185 



sequence Rv2169c predicted by Cole et 
containing seq35A 



1/1 







etc 


tec 


gat 


cat 


craq 


cag 


egg 


atg 






leu. 


ser 


asp 


his 


glu 


gin 


arg 


met 




61/21 




















y da y «■ *— 


ccc 


aag 


ttc 


aca 


teg 


agt 


gtc 


cgt 




y J. Li c. o ^> 


pro 


1 vs 


phe 


ala 


ser 


ser 


val 


arg 




121/41 




















egg cgc 


ctg 


cag 


ggc 


gcg 


gcg 


ttg 


ttc 


ate 




arg arg 


leu 


gin 


gly 


ala 


ala 


leu 


phe 


ile 




181/61 




















gcg ttc 


aaa 


gag 


acc 


atg 


ate 


gga 


agt 


ttc 




ala phe 


lys 


glu 


thr 


met 


ile 


gly 


ser 


phe 




241/81 




















atg ttc 


ggt 


ggt 


gtg 


gtg 


tat 


gee 


ate 


acc 




met phe 


gly 


gly 


val 


val 


tyr 


ala 


ile 


thr 




301/101 




















ggc gga 


teg 


get 


get 


ggg 


get 


teg 


cgc 


cag 




gly gly 


ser 


ala 


ala 


gly 


ala 


ser 


arg 


gin 




361/121 




















acc age 


cgt 


atg 


gaa 


gat 


egg 


ttc 


egg 


cgc 




thr ser 


arg 


met 


glu 


asp 


arg 


phe 


arg 


arg 



SEQ 



al. (Nature 393:537-544) and partially 



31/11 

ctt gac cag ate gag age get etc tac gec 
leu asp gin ile glu ser ala leu tyr ala 
91/31 

ggc ggg ggc ttc cgc gca ccg acc gcg egg 
gly gly gly phe arg ala pro thr ala arg 
151/51 

ate ggt ctg ggg atg ttg gtt tec ggc gtg 
ile gly leu gly met leu val ser gly val 
211/71 

ccg ata etc age gtt ttc ggt ttt gtc gtg 
pro ile leu ser val phe gly phe val val 
271/91 

ggt cct egg ttg tec ggc agg atg gat cgt 
gly pro arg leu ser gly arg met asp arg 
331/111 

cgt cgt acc aag ggg gee ggg ggc tea ttc 
arg arg thr lys gly ala gly gly ser phe 
391/131 

cgc ttc gac gag taa 
arg phe asp glu OCH 



No.35D 



FIGURE 35D 



ORF according to Cole et al. (Nature 393:537-544) and containing Rv2169c 

1/1 31/11 
tga cag tct gtc ggc aag gag gga cgc atg cca etc tec 
OPA gin ser val gly lys glu gly arg met pro leu ser 
61/21 91/31 
gac cag ate gag age get etc tac gee gaa gat ccc aag 
asp gin ile glu ser ala leu tyr ala glu asp pro lys 
121/41 151/51 
ggg ggc ttc cgc gca ccg acc gcg egg egg cgc ctg cag 
qly gly phe arg ala pro thr ala arg arg arg leu gin 
181/61 211/71 
ggt ctg ggg atg ttg gtt tec ggc gtg gcg ttc aaa gag 
gly leu gly met leu val ser gly val ala phe lys glu 
241/81 271/91 
ata etc age gtt ttc ggt ttt gtc gtg atg ttc ggt ggt 
ile leu ser val phe gly phe val val met phe gly gly 
301/101 331/111 
cct egg ttg tec ggc agg atg gat cgt ggc gga teg get 
pro arg leu ser gly arg met asp arg gly gly ser ala 
361/121 391/131 



gat 
asp 


cat 
his 


gag 
glu 


cag 
gin 


egg 
arg 


atg 
met 


ctt 
leu 


ttc 
phe 


gca 
ala 


teg 
ser 


agt 
ser 


gtc 
val 


cgt 
arg 


ggc 
gly 


ggc 
gly 


gcg 
ala 


gcg 
ala 


ttg 
leu 


ttc 
phe 


ate 
ile 


ate 
ile 


acc 
thr 


atg 
met 


ate 
ile 


gga 
gly 


agt 
ser 


ttc 
phe 


ccg 
pro 


gtg 
val 


gtg 
val 


tat 
tyr 


gee 
ala 


ate 
ile 


acc 
thr 


ggt 
gly 


get 
ala 


ggg 

gly 


get 
ala 


teg 
ser 


cgc 
arg 


cag 
gin 


cgt 
arg 


gaa 
glu 


gat 
asp 


egg 
arg 


ttc 
phe 


egg 
arg 


cgc 
arg 


cgc 
arg 



421/141 

ttc gac gag taa 
phe asp glu OCH 

SEQ ID 35F 



FIGURE 35F 
REPLACEMENT SHEET (RULE 26) 



• 

109/185 



1/1 

GAC CTG GGA CGA AGA CGA CGG 
asp leu gly arg arg arg arg 
61/21 

CAA TGG ACA CCC GAC TAC GGT 
gin trp thr pro asp tyr gly 
121/41 

GTG GTC GGA GTG CCA CCC GAC 
val val gly val pro pro asp 
181/61 

ACG GCG CCG GTC TGG ATC ACC 
thr ala pro val trp ile thr 
241/81 

GGG GCA CCC GGT GGC ACC GTT 
gly ala pro gly gly thr val 
301/101 

GCC AAC GGC GGC CGG. CTG GAC 
ala asn gly gly arg leu asp 
361/121 

GTC GAC CCC GAC GGC GCC GTC 
val asp pro asp gly ala val 



31/11 



CAG 


CAG 


CCG 


CAA TCA 


GAT 


gin 


gin 


pro 


gin ser 


-* tr 








91/31 




GCG 


CCT 


GCG 


CGG CTC 


GAC 


ala 


pro 


ala 


arg leu 


asp 

U. W £T 








151/51 




CAG 


GCC 


ACC 


GAC TTC 


GGC 


gin 


ala 


thr 


asp phe 


gly 






211/71 




ATG 


CTG 


TGG 


CCG CTG 


GCC 


met 


leu 


trp 


pro leu 


ala 








271/91 




CCC 


GTC 


CGG 


CTG GTC 


GAC 


pro 


val 


arg 


leu val 


asp 








331/111 




ATC 


CTC 


CTG 


TCG GCG 


GCC 


ile 


leu 


leu 


ser ala 


ala 








391/131 




GGC 


CGA 


GCG 


CTG TGC 


CTG 


gly 


arg 


ala 


leu cys 


leu 



SEQ ID No.36A 
FIGURE 36A 



CTA CCC GGT CCT GGT CAA CGT 
leu pro gly pro gly gin arg 

AAT GCG CGG TTC CTG TTG CCC 
asn ala arg phe leu leu pro 

TCC GCT GTT GCA CCA GAA ACG 
ser ala val ala pro glu thr 

GAC CGG CCC CGG TTG GCC CCC 
asp arg pro arg leu ala pro 

GAC GAC CTG GCA AAC TCG CTG 
asp asp leu ala asn ser leu 

GAG TTC GCC ACC AAC CGG GAA 
glu phe ala thr asn arg glu 

GCC ATC GAC CCA GAT C 
ala ile asp pro asp 



1/1 

ACC TGG GAC GAA GAC 
thr trp asp glu asp 
61/21 

AAT GGA CAC CCG ACT 
asn gly his pro thr 
121/41 

TGG TCG GAG TGC CAC 
trp ser glu cys his 
181/61 

CGG CGC CGG TCT GGA 
arg arg arg ser gly 
241/81 

GGG CAC CCG GTG GCA 
gly his pro val ala 
301/101 
^CCA ACG GCG GCC GGC 
pro thr ala ala gly 
361/121 

TCG ACC CCG ACG GCG 
ser thr pro thr ala 



GAC GGC AGC AGC CGC 
asp gly ser ser arg 

ACG GTG CGC CTG CGC 
thr val arg leu arg 

CCG ACC AGG CCA CCG 
pro thr arg pro pro 

TCA CCA TGC TGT GGC 
ser pro cys cys gly 

CCG TTC CCG TCC GGC 
pro phe pro ser gly 

TGG ACA TCC TCC TGT 
trp thr ser ser cys 

CCG TCG GCC GAG CGC 
pro ser ala glu arg 



31/11 

AAT CAG ATC TAC CCG 
asn gin ile tyr pro 
91/31 

GGC TCG ACA ATG CGC 
gly ser thr met arg 
151/51 

ACT TCG GCT CCG CTG 
thr ser ala pro leu 
211/71 

CGC TGG CCG ACC GGC 
arg trp pro thr gly 
271/91 

TGG TCG ACG ACG ACC 
trp ser thr thr thr 
331/111 

CGG CGG CCG AGT TCG 
arg arg pro ser ser 
391/131 

TGT GCC TGG CCA TCG 
cys ala trp pro ser 



GTC CTG GTC AAC GTC 
val leu val asn val 

GGT TCC TGT TGC CCG 
gly ser cys cys pro 

TTG CAC CAG AAA CGA 
leu his gin lys arg 

CCC GGT TGG CCC CCG 
pro gly trp pro pro 

TGG CAA ACT CGC TGG 
trp gin thr arg trp 

CCA CCA ACC GGG AAG 
pro pro thr gly lys 

ACC CAG ATC 
thr gin ile 



SEQ ID No.36B 
FIGURE 36B 
REPLACEMENT SHEET (RULE 26) 



1 



110/185 



1/1 

CCT GGG ACG AAG ACG ACG GCA 
pro gly thr lys thr thr ala 
61/21 

ATG GAC ACC CGA CTA CGG TGC 
met asp thr arg leu arg cys 
121/41 

GGT CGG AGT GCC ACC CGA CCA 
gly arg ser ala thr arg pro 
181/61 

GGC GCC GGT CTG GAT CAC CAT 
gly ala gly leu asp his his 
241/81 

GGC ACC CGG TGG CAC CGT TCC 
gly thr arg trp his arg ser 
301/101 

CAA CGG CGG CCG GCT GGA CAT 
gin arg arg pro ala gly his 
361/121 

CGA CCC CGA CGG CGC CGT CGG 
arg pro arg arg arg arg arg 



31/11 

GCA GCC GCA ATC AGA TCT ACC 
ala ala ala ile arg ser thr 
91/31 

GCC TGC GCG GCT CGA CAA TGC 
ala cys ala ala arg gin cys 
151/51 

GGC CAC CGA CTT CGG CTC CGC 
gly his arg leu arg leu arg 
211/71 

GCT GTG GCC GCT GGC CGA CCG 
ala val ala ala gly arg pro 
271/91 

CGT CCG GCT GGT CGA CGA CGA 
arg pro ala gly arg arg arg 

331/111 
CCT CCT GTC GGC GGC CGA GTT 
pro pro val gly gly arg val 

391/131 

CCG AGC GCT GTG CCT GGC CAT 
pro ser ala val pro gly his 



SEQ ID No. 36 C 
FIGURE 36C 



CGG TCC TGG TCA ACG TCA 
arg ser trp ser thr ser 

GCG GTT CCT GTT GCC CGT 
ala val pro val ala arg 

TGT TGC ACC AGA AAC GAC 
cys cys thr arg asn asp 

GCC CCG GTT GGC CCC CGG 
ala pro val gly pro arg 

CCT GGC AAA CTC GCT GGC 
pro gly lys leu ala gly 

CGC CAC CAA CCG GGA AGT 
arg his gin pro gly ser 

CGA CCC AGA TC 
arg pro arg 



Coding sequence Rv3909 predicted by Cole et al . , 1998 (Nature 393 537-544) 
containing Seq 36A 



1/1 

GTG ACC 
met thr 
61/21 
GTG GCC 
val ala 
121/41 
GAG CCC 
glu pro 
181/61 
ACC ACT 
thr thr 
241/81 
CCA GTC 
pro val 
301/101 









31/11 


















TGG 


GCC 


GCT 


TTG GCG 


CGC 


GTC 


ACC 


TCA 


GCG 


ATC 


GGC 


GTC 


trp 


ala 


ala 


leu ala 


arg 


val 


thr 


ser 


ala 


ile 


gly 


val 






91/31 


















CTC 


ACG 


GTA 


CCG TCG 


GCG 


GCA 


CCG 


CAC 


GCG 


CTC 


GCA 


GGC 


leu 


thr 


val 


pro ser 


ala 


ala 


pro 


his 


ala 


leu 


ala 


gly 








151/51 


















GTC 


CAG 


GTC 


CGC ATC 


GAT 


CAG 


GTG 


ACC 


CCG 


GAC 


GTG 


GTG 


val 


gin 


val 


arg ile 


asp 


gin 


val 


thr 


pro 


asp 


val 


val 








211/71 


















GTC 


ACC 


GTC 


AGC GGA 


ACG 


GTG 


ACC 


AAT 


ACC 


GGT. 


GAC 


CGC 


val 


thr 


val 


ser gly 


thr 


val 


thr 


asn 


thr 


gly 


asp 


arg 








271/91 


















CGG 


CTT 


GAG 


CAC GCC 


GCC 


GCG 


GTC 


ACG 


TCG 


TCA 


ACG 


GCG 


arg 


leu 


glu 


his ala 


ala 


ala 


val 


thr 


ser 


ser 


thr 


ala 






331/111 


















GGC 


ACC 


GAC 


CAG TAC 


CAG 


CCG 


GCC 


GCG 


GAC 


TTC 


CTC 


ACG 


gly 


thr 


asp 


gin tyr 


gin 


pro 


ala 


ala 


asp 


phe 


leu 


thr 



SEQ ID No.36D 
FIGURE 36D 



REPLACEMENT SHEET (RULE 26) 



111/185 



361/121 391/131 

GTC GCC CCC GAA CTA GAC CGC GGG CAA GAG GCC GGC TTT ACC CTC TCG GCC CCG CTG CGC 
val ala pro glu leu asp arg gly gin glu ala gly phe thr leu ser ala pro leu arg 
421/141 451/151 

TCG CTG ACC AGG CCG TCG TTG GCC GTC AAC CAG CCC GGG ATC TAC CCG GTC CTG GTC AAC 
ser leu thr arg pro ser leu ala val asn gin pro gly ile tyr pro val leu val asn 
481/161 511/171 

GTC AAT GGG ACA CCC GAC TAC GGT GCG CCT GCG CGG CTC GAC AAT GCG CGG TTC CTG TTG 
val asn gly thr pro asp tyr gly ala pro ala arg leu asp asn ala arg phe leu leu 
541/181 571/191 

CCC GTG GTC GGA GTG CCA CCC GAC CAG GCC ACC GAC TTC GGC TCC GCT GTT GCA CCA GAA 
pro val val gly val pro pro asp gin ala thr asp phe gly ser ala val ala pro glu 
601/201 631/211 

ACG ACG GCG CCG GTC TGG ATC ACC ATG CTG TGG CCG CTG GCC GAC CGG CCC CGG TTG GCC 
thr thr ala pro val trp ile thr met leu trp pro leu ala asp arg pro arg leu ala 
661/221 691/231 

CCC GGG GCA CCC GGT GGC ACC GTT CCC GTC CGG CTG GTC GAC GAC GAC CTG GCA AAC TCG 
pro gly ala pro gly gly thr val pro val arg leu val asp asp asp leu ala asn ser 
721/241 751/251 

CTG GCC AAC ,GGC GGC CGG CTG GAC ATC CTC CTG TCG GCG GCC GAG TTC GCC ACC AAC CGG 
leu ala asn gly gly arg leu asp ile leu leu ser ala ala glu phe ala thr asn arg 
781/261 811/271 

GAA GTC GAC CCC GAC GGC GCC GTC GGC CGA GCG CTG TGC CTG GCC ATC GAC CCA GAT CTA 
glu val asp pro asp gly ala val gly arg ala leu cys leu ala ile asp pro asp leu 
841/281 871/291 

CTC ATC ACC GTC AAT GCG ATG ACC GGC GGC TAC GTC GTG TCC GAC TCG CCC GAC GGG GCC 
leu ile thr val asn ala met thr gly gly tyr val val ser asp ser pro asp gly ala 
901/301 931/311 

GCT CAA CTA CCG GGC ACC CCG ACC CAC CCG GGC ACC GGC CAG GCC GCC GCA TCC AGC TGG 
ala gin leu pro gly thr pro thr his pro gly thr gly gin ala ala ala ser ser trp 
961/321 991/331 

CTG GAT CGA TTG CGG ACG CTA GTC CAC CGG ACA TGC GTG ACG CCG CTG CCT TTT GCC CAA 
leu asp arg leu arg thr leu val his arg thr cys val thr pro leu pro phe ala gin 
1021/341 1051/351 

GCC GAC CTG GAT GCT TTG CAG CGG GTT AAT GAT CCG AGG CTG AGC GCG ATC GCA ACC ATC 
ala asp leu asp ala leu gin arg val asn asp pro arg leu ser ala ile ala thr ile 
1081/361 1111/371 

AGC CCC GCC GAC ATC GTC GAC CGC ATC CTG GAT GTC AGC TCC ACC CGC GGC GCA ACC GTG 
ser pro ala asp ile val asp arg ile leu asp val ser ser thr arg gly ala thr val 
1141/381 1171/391 

CTG CCC GAC GGC CCG TTG ACC GGC CGG GCG ATC AAC TTG CTC AGC ACC CAC GGC AAC ACG 
leu pro asp gly pro leu thr gly arg ala ile asn leu leu ser thr his gly asn thr 
1201/401 1231/411 

GTT GCC GTC GCG GCC GCC GAT TTT AGC CCC GAG GAA CAG CAG GGT TCG TCC CAG ATC GGC 
val ala val ala ala ala asp phe ser pro glu glu gin gin gly ser ser gin ile gly 
1261/421 1291/431 

TCC GCG CTC TTA CCC GCT ACC GCG CCC CGG CGG TTG TCC CCG CGG GTG GTA GCG GCG CCG 
ser ala leu leu pro ala thr ala pro arg arg leu ser pro arg val val ala ala pro 
1321/441 1351/451 

TTT GAT CCC GCG GTC GGG GCC GCG CTG GCC GCC GCG GGA ACA AAC CCG ACC GTT CCT ACC 
phe asp pro ala val gly ala ala leu ala ala ala gly thr asn pro thr val pro thr 
1381/461 1411/471 

TAT CTA GAT CCC TCG TTG TTC GTT CGG ATC GCG CAT GAA TCG ATC ACC GCG CGC CGC CAG 
tyr leu asp pro ser leu phe val arg ile ala his glu ser ile thr ala arg arg gin 



SEQ ID No. 3 6D (continued 1) 
FIGURE 36D (continued 1) 
REPLACEMENT SHEET (RULE 26) 



112/185 



1441/481 1471/491 

GAC GCC TTG GGC GCA ATG CTG TGG CGC AGC TTG GAG CCG AAT GCC GCG CCC CGT ACC CAA 
asp ala leu gly ala met leu trp arg ser leu glu pro asn ala ala pro arg thr gin 
1501/501 1531/511 

ATC CTG GTG CCG CCG GCG TCG TGG AGC CTG GCC AGC GAC GAC GCG CAG GTC ATC CTG ACC 
ile leu val pro pro ala ser trp ser leu ala ser asp asp ala gin val ile leu thr 
1561/521 1591/531 

GCG CTG GCC ACC GCC ATC CGG TCT GGC CTG GCC GTG CCG CGA CCA CTA CCG GCG GTG ATC 
ala leu ala thr ala ile arg ser gly leu ala val pro arg pro leu pro ala val ile 
1621/541 1651/551 

GCT GAC GCC GCG GCC CGC ACC GAG CCA CCG GAA CCC CCG GGC GCT TAC AGC GCC GCT CGC 
ala asp ala ala ala arg thr glu pro pro glu pro pro gly ala tyr ser ala ala arg 
1681/561 1711/571 

GGC CGG TTC AAT GAC GAC ATC ACC ACG CAG ATC GGC GGG CAG GTT GCC CGG CTA TGG AAG 
qlv arg phe asn asp asp ile thr thr gin ile gly gly gin val ala arg leu trp lys 
1741/581 1771/591 

CTG ACC TCG GCG TTG ACC ATC GAT GAC CGC ACC GGG CTG ACC GGC GTG CAG TAC ACC GCA 
leu thr ser ala leu thr ile asp asp arg thr gly leu thr gly val gin tyr thr ala 
1801/601 1831/611 

CCA CTA CGC GAG GAC ATG TTG CGC GCG CTG AGC CAA TCG CTA CCA CCC GAT ACC CGC AAC 
pro leu arg glu asp met leu arg ala leu ser gin ser leu pro pro asp thr arg asn 
1861/621 1891/631 

GGG CTG GCC CAG CAG CGG CTG GCC GTC GTT GGA AAG ACG ATC GAC GAT CTT TTC GGC GCG 
gly leu ala gin gin arg leu ala val val gly lys thr ile asp asp leu phe gly ala 
1921/641 1951/651 

GTG ACC ATC GTC AAC CCG GGC GGC TCC TAC ACT CTG GCC ACC GAG CAC AGT CCG CTG CCG 
val thr ile val asn pro gly gly ser tyr thr leu ala thr glu his ser pro leu pro 
1981/661 • 2011/671 

TTG GCG CTG CAT AAT GGC CTC GCC GTG CCA ATC CGG GTC CGG CTA CAG GTC GAT GCT CCG 
leu ala leu his asn gly leu ala val pro ile arg val arg leu gin val asp ala pro 
2041/681 2071/691 

CCC GGG ATG ACG GTG GCC GAT GTC GGT CAG ATC GAG CTA CCG CCC GGG TAC CTG CCG CTA 
pro gly met thr val ala asp val gly gin ile glu leu pro pro gly tyr leu pro leu 
2101/701 2131/711 

CGA GTA CCA ATC GAG GTG AAC TTC ACA CAG CGG GTT GCC GTC GAC GTG TCG CTG CGG ACC 
arg val pro ile glu val asn phe thr gin arg val ala val asp val ser leu arg thr 
2161/721 2191/731 

CCC GAC GGC GTC GCG CTG GGT GAA CCG GTG CGG TTG TCG GTG CAC TCC AAC GCC TAC GGC 
pro asp gly val^ ala leu gly glu pro val arg leu ser val his ser asn ala tyr gly 
2221/741 2251/751 

AAG GTG TTG TTC GCG ATC ACG CTA TCC GCT GCG GCC GTG CTG GTA ACG CTG GCG GGC CGG 
lys val leu phe ala ile thr leu ser ala ala ala val leu val thr leu ala gly arg 
2281/761 2311/771 

CGC CTT TGG CAC CGG TTC CGT GGC CAG CCT GAT CGC GCC GAC CTG GAT CGC\ CCC GAC CTG 
arg leu trp his arg phe arg gly gin pro asp arg ala asp leu asp arg pro asp leu 
2341/781 2371/791 

CCT ACC GGC AAA CAC GCC CCG CAG CGC CGT GCC GTA GCC AGT CGG GAT GAC GAA AAG CAC 
pro thr gly lys his ala pro gin arg arg ala val ala ser arg asp asp glu lys his 
2401/801 
CGG GTA TGA 
arg val OPA 



SEQ ID No.36D (continued 2) 
FIGURE 36D (continued 2) 



REPLACEMENT SHEET (RULE 26) 




113/185 



ORF according to Cole et al., 1998 (Nature 393 537-544) and containing Rv 3909. 
1/1 31/11 

TGA CTC AGC ACC GGG TCA GCA CAA CGG TCC CGG GCC GGG GCC GTG ACC GCA CTG CAA CTC 
OPA leu ser thr gly ser ala gin arg ser arg ala gly ala val thr ala leu gin leu 
61/21 * 91/31 

GGC TGG GCC GCT TTG GCG CGC GTC ACC TCA GCG ATC GGC GTC GTG GCC GGC CTC GGG ATG 
gly trp ala ala leu ala arg val thr ser ala ile gly val val ala gly leu gly met 
121/41 151/51 

GCG CTC ACG GTA CCG TCG GCG GCA CCG CAC GCG CTC GCA GGC GAG CCC AGC CCG ACG CCT 
ala leu thr val pro ser ala ala pro his ala leu ala gly glu pro ser pro thr pro 
181/61 211/71 

TTT GTC CAG GTC CGC ATC GAT CAG GTG ACC CCG GAC GTG GTG ACC ACT TCC AGC GAA CCC 
phe val gin val arg ile asp gin val thr pro asp val val thr thr ser ser glu pro 
241/81 271/91 

CAT GTC ACC GTC AGC GGA ACG GTG ACC AAT ACC GGT GAC CGC CCA GTC CGC GAT GTG ATG 
his val thr val ser gly thr val thr asn thr gly asp arg pro val arg asp val met 
301/101 331/111 

GTC CGG CTT GAG CAC GCC GCC GCG GTC ACG TCG TCA ACG GCG TTA CGC ACC TCG CTC GAC 
val arg leu glu his ala ala ala val thr ser ser thr ala leu arg thr ser leu asp 
361/121 391/131 

GGC GGC ACC GAC CAG TAC CAG CCG GCC GCG GAC TTC CTC ACG GTC GCC CCC GAA CTA GAC 
gly gly thr asp gin tyr gin pro ala ala asp phe leu thr val ala pro glu leu asp 
421/141 451/151 

CGC GGG CAA GAG GCC GGC TTT ACC CTC TCG GCC CCG CTG CGC TCG CTG ACC AGG CCG TCG 
arg gly gin glu ala gly phe thr leu ser ala pro leu arg ser leu thr arg pro ser 
481/161 511/171 

TTG GCC GTC AAC CAG CCC GGG ATC TAC CCG GTC CTG GTC AAC GTC AAT GGG ACA CCC GAC 
leu ala val asn gin pro gly ile tyr pro val leu val asn val asn gly thr pro asp 
541/181 571/191 

TAC GGT GCG CCT GCG CGG CTC GAC AAT GCG CGG TTC CTG TTG CCC GTG GTC GGA GTG CCA 
tyr gly ala pro ala arg leu asp asn ala arg phe leu leu pro val val gly val pro 
601/201 631/211 

CCC GAC CAG GCC ACC GAC TTC GGC TCC GCT GTT GCA CCA GAA ACG ACG GCG CCG GTC TGG 
pro asp gin ala thr asp phe gly ser ala val ala pro glu thr thr ala pro val trp 
661/221 691/231 

ATC ACC ATG CTG TGG CCG CTG GCC GAC CGG CCC CGG TTG GCC CCC GGG GCA CCC GGT GGC 
ile thr met leu trp pro leu ala asp arg pro arg leu ala pro gly ala pro gly gly 
721/241 751/251 

ACC GTT CCC GTC CGG CTG GTC GAC GAC GAC CTG GCA AAC TCG CTG GCC AAC GGC GGC CGG 
thr val pro val arg leu val asp asp asp leu ala asn ser leu ala asn gly gly arg 
781/26L 811/271 

CTG GAC ATC CTC CTG TCG GCG GCC GAG TTC GCC ACC AAC CGG GAA GTC GAC CCC GAC GGC 
leu asp ile leu leu ser ala ala glu phe ala thr asn arg glu val asp pro asp gly 
841/281 871/291 

GCC GTC GGC CGA GCG CTG TGC CTG GCC ATC GAC CCA GAT CTA CTC ATC ACC GTC AAT GCG 
ala val gly arg ala leu cys leu ala ile asp pro asp leu leu ile thr val asn ala 
901/301 931/311 

ATG ACC GGC GGC TAC GTC GTG TCC GAC TCG CCC GAC GGG GCC GCT CAA CTA CCG GGC ACC 
met thr gly gly tyr val val ser asp ser pro asp gly ala ala gin leu pro gly thr 
961/321 991/331 

CCG ACC CAC CCG GGC ACC GGC CAG GCC GCC GCA TCC AGC TGG CTG GAT CGA TTG CGG ACG 
pro thr his pro gly thr gly gin ala ala ala ser ser trp leu asp arg leu arg thr 



SEQ ID No.36F 

FIGURE 36F 
REPLACEMENT SHEET (RULE 26) 



114/185 



1021/341 

CTA GTC CAC CGG ACA TGC GTG ACG CCG CTG 
leu val his arg thr cys val thr pro leu 
1081/361 

CAG CGG GTT AAT GAT CCG AGG CTG AGC GCG 
gin arg val asn asp pro arg leu ser ala 
1141/381 

GAC CGC ATC CTG GAT GTC AGC TCC ACC CGC 
asp arg ile leu asp val ser ser thr arg 
1201/401 

ACC GGC CGG GCG ATC AAC TTG CTC AGC ACC 
thr gly arg ala ile asn leu leu ser thr 
1261/421 

GAT TTT AGC CCC GAG GAA CAG CAG GGT TCG 
asp phe ser pro glu glu gin gin gly ser 
1321/441 

ACC GCG CCC CGG CGG TTG TCC CCG CGG GTG 
thr ala pro arg arg leu ser pro arg val 
1381/461 

GCC GCG CTG GCC GCC GCG GGA ACA AAC CCG 
ala ala leu ala ala ala gly thr asn pro 
1441/481 

TTC GTT CGG ATC GCG CAT GAA TCG ATC ACC 
phe val arg ile ala his glu ser ile thr 
1501/501 

CTG TGG CGC AGC TTG GAG CCG AAT GCC GCG 
leu trp arg ser leu glu pro asn ala ala 
1561/521 

TCG TGG AGC CTG GCC AGC GAC GAC GCG CAG 
ser trp ser leu ala ser asp asp ala gin 
1621/541 

CGG TCT GGC CTG GCC GTG CCG CGA CCA CTA 
arg ser gly leu ala val pro arg pro leu 
1681/561 

ACC GAG CCA CCG GAA CCC CCG GGC GCT TAC 
thr glu pro pro glu pro pro gly ala tyr 
1741/581 

ATC ACC ACG CAG ATC GGC GGG CAG GTT GCC 
ile thr thr gin ile gly gly gin val ala 
1801/601 

ATC GAT GAC CGC ACC GGG CTG ACC GGC GTG 
ile asp asp arg thr gly leu thr gly val 
1861/621 

TTG CGC GCG CTG AGC CAA TCG CTA CCA CCC 
leu arg ala leu ser gin ser leu pro pro 
1921/641 

CTG GCC GTC GTT GGA AAG ACG ATC GAC GAT 
leu ala val val gly lys thr ile asp asp 
1981/661 

GGC GGC TCC TAC ACT CTG GCC ACC GAG CAC 
gly gly ser tyr thr leu ala thr glu his 



SEQ ID 36F 
FIGURE 36F 



1051/351 

CCT TTT GCC CAA GCC GAC CTG GAT GCT TTG 
pro phe ala gin ala asp leu asp ala leu 
1111/371 

ATC GCA ACC ATC AGC CCC GCC GAC ATC GTC 
ile ala thr ile ser pro ala asp ile val 
1171/391 

GGC GCA ACC GTG CTG CCC GAC GGC CCG TTG 
gly ala thr val leu pro asp gly pro leu 
1231/411 

CAC GGC AAC ACG GTT GCC GTC GCG GCC GCC 
his gly asn thr val ala val ala ala ala 
1291/431 

TCC CAG ATC GGC TCC GCG .CTC TTA CCC GCT 
ser gin ile gly ser ala leu leu pro ala 
1351/451 

GTA GCG GCG CCG TTT GAT CCC GCG GTC GGG 
val ala ala pro phe asp pro ala val gly 
1411/471 

ACC GTT CCT ACC TAT CTA GAT CCC TCG TTG 
thr val pro thr tyr leu asp pro ser leu 
1471/491 

GCG CGC CGC CAG GAC GCC TTG GGC GCA ATG 
ala arg arg gin asp ala leu gly ala met 
1531/511 

CCC CGT ACC CAA ATC CTG GTG CCG CCG GCG 
pro arg thr gin ile leu val pro pro ala 
1591/531 

GTC ATC CTG ACC GCG CTG GCC ACC GCC ATC 
val ile leu thr ala leu ala thr ala ile 
1651/551 

CCG GCG GTG ATC GCT GAC GCC GCG GCC CGC 
pro ala val ile ala asp ala ala ala arg 
1711/571 

AGC GCC GCT CGC GGC CGG TTC AAT GAC GAC 
ser ala ala arg gly arg phe asn asp asp 
1771/591 

CGG CTA TGG AAG CTG ACC TCG GCG TTG ACC 
arg leu trp lys leu thr ser ala leu thr 
1831/611 

CAG TAC ACC GCA CCA CTA CGC GAG GAC ATG 
gin tyr thr ala pro leu arg glu asp met 
1891/631 

GAT ACC CGC AAC GGG CTG GCC CAG CAG CGG 
asp thr arg asn gly leu ala gin gin arg 
1951/651 

CTT TTC GGC GCG GTG ACC ATC GTC AAC CCG 
leu phe gly ala val thr ile val asn pro 
2011/671 

AGT CCG CTG CCG TTG GCG CTG CAT AAT GGC 
ser pro leu pro leu ala leu his asn gly 



(continued 1) 
(continued 1) 



REPLACEMENT SHEET (RULE 26) 



• 



115/185 



2071/691 

CGG GTC CGG CTA CAG GTC GAT GCT CCG CCC GGG ATG ACG GTG GCC 
arg val arg leu gin val asp ala pro pro gly met thr val ala 

2131/711 

GAG CTA CCG CCC GGG TAC CTG CCG CTA CGA GTA CCA ATC GAG GTG 
glu leu pro pro gly tyr leu pro leu arg val pro ile glu val 

2191/731 

GTT GCC GTC GAC GTG TCG CTG CGG ACC CCC GAC GGC GTC GCG CTG 
val ala val asp val ser leu arg thr pro asp gly val ala leu 

2251/751 

TTG TCG GTG CAC TCC AAC GCC TAC GGC AAG GTG TTG TTC GCG ATC 
leu ser val his ser asn ala tyr gly lys val leu phe ala ile 

2311/771 

GCC GTG CTG GTA ACG CTG GCG GGC CGG CGC CTT TGG CAC CGG TTC 
ala val leu val thr leu ala gly arg arg leu trp his arg phe 

2371/791 

CGC GCC GAC CTG GAT CGC CCC GAC CTG CCT ACC GGC AAA CAC GCC 
arg ala asp leu asp arg pro asp leu pro thr gly lys his ala 

2431/811 

GTA GCC AGT CGG GAT GAC GAA AAG CAC CGG GTA TGA 
val ala ser arg asp asp glu lys his arg val OPA 



SEQ ID 36F (continued 2) 
FIGURE 36F (continued 2) 



1/1 


















31/11 


















ATC CGC 


GCG 


TTG 


GCG 


TCG 


CAT 


CCG 


AAC 


ATC 


GTC GGA 


GTC 


AAG 


GAC 


GCC 


AAA 


GCC 


GAC 


CTG 


ile arg 


ala 


leu 


ala 


ser 


his 


pro 


asn 


ile 


val gly 


val 


lys 


asp 


ala 


lys 


ala 


asp 


leu 


61/21 


















91/31 














GAC 


GCG 


CAC AGC 


GGC 


GCC 


CAA 


ATC 


ATG 


GCC 


GAC 


ACC 


GGA CTG 


GCC 


TAC 


TAT 


TCC 


GGC 


GAC 


his ser 


gly 


ala 


gin 


ile 


met 


ala 


asp 


thr 


gly leu 


ala 


tyr 


tyr 


ser 


gly 


asp 


asp 


ala 


121/41 














151/51 














CAC 


CTG 


CTC AAC 


CTG 


CCC 


TGG 


CTG 


GCC 


ATG 


GGC 


GCC 


ACG GGC 


TTC 


ATC 


AGC 


GTG 


ATT 


GCC 


leu asn 


leu 


pro 


trp 


leu 


ala 


met 


gly 


ala 


thr gly 


phe 


ile 


ser 


val 


ile 


ala 


his 


leu 


181/61 
















211/71 














ACC 


GCC 


GCA GCC 


GGG 


CAG 


CTT 


CGA 


GAG 


TTG 


TTG 


TCC 


GCC TTC 


GGT 


TCT 


GGG 


GAT 


ATC 


GCC 


ala ala 


gly 


gin 


leu 


arg 


glu 


leu 


leu 


ser 


ala phe 


gly 


ser 


gly 


asp 


ile 


ala 


' thr 


ala 



241/81 

CGC AAG ATC 

arg lys ile 

SEQ ID No.37A 
FIGURE 37A 



2041/681 

CTC GCC GTG CCA ATC 
leu ala val pro ile 
2101/701 

GAT GTC GGT CAG ATC 
asp val gly gin ile 
2161/721 

AAC TTC ACA CAG CGG 
asn phe thr gin arg 
2221/741 

GGT GAA CCG GTG CGG 
gly glu pro val arg 
2281/761 

ACG CTA TCC GCT GCG 
thr leu ser ala ala 
2341/781 

CGT GGC CAG CCT GAT 
arg gly gin pro asp 
2401/801 

CCG CAG CGC CGT GCC 
pro gin arg arg ala 



REPLACEMENT SHEET (RULE 26) 



116/185 



!/l 31/11 x 

GAT CCG CGC GTT GGC GTC GCA TCC GAA CAT CGT CGG AGT CAA GGA CGC CAA AGC CGA CCT 
asp pro arg val gly val ala ser glu his arg arg ser gin gly arg gin ser arg pro 
61/21 91/31 

GCA CAG CGG CGC CCA AAT CAT GGC CGA CAC CGG ACT GGC CTA CTA TTC CGG CGA CGA CGC 
ala gin arg arg pro asn his gly arg his arg thr gly leu leu phe arg arg arg arg 
121/41 151/51 

GCT CAA CCT GCC CTG GCT GGC CAT GGG CGC CAC GGG CTT CAT CAG CGT GAT TGC CCA CCT 
ala gin pro ala leu ala gly his gly arg his gly leu his gin arg asp cys pro pro 
181/61 211/71 

GGC AGC CGG GCA GCT TCG AGA GTT GTT GTC CGC CTT CGG TTC TGG GGA TAT CGC CAC CGC 
gly ser arg ala ala ser arg val val val arg leu arg phe trp gly tyr arg his arg 
241/81 

CCG CAA GAT C 
pro gin asp 



SEQ ID No.37B 
FIGURE 37B 



1/1 

TCC GCG CGT TGG CGT CGC ATC 
ser ala arg trp arg arg ile 
61/21 

ACA GCG GCG CCC AAA TCA TGG 
thr ala ala pro lys ser trp 
121/41 

TCA ACC TGC CCT GGC TGG CCA 
ser thr cys pro gly trp pro 
181/63. 

CAG CCG GGC AGC TTC GAG AGT 
gin pro gly ser phe glu ser 
241/81 
GCA AGA TC 
ala arg 



31/11 



CGA 


ACA 


TCG 


TCG GAG 


TCA 


AGG 


arg 


thr 


ser 


ser glu 


ser 


arg 








91/31 






CCG 


ACA 


CCG 


GAC TGG 


CCT 


ACT 


pro 


thr 


pro 


asp trp 


pro 


thr 








151/51 






TGG 


GCG 


CCA 


CGG GCT 


TCA 


TCA 


trp 


ala 


pro 


arg ala 


ser 


ser 








211/71 






TGT 


TGT 


CCG 


CCT TCG 


GTT 


CTG 


cys 


cys 


pro 


pro ser 


val 


leu 



SEQ ID No.37C 
FIGURE 37C 



ACG CCA AAG CCG ACC TGC 
thr pro lys pro thr cys 

ATT CCG GCG ACG ACG CGC 
ile pro ala thr thr arg 

GCG TGA TTG CCC ACC TGG 
ala OPA leu pro thr trp 

GGG ATA TCG CCA CCG CCC 
gly ile ser pro pro pro 



REPLACEMENT SHEET (RULE 26) 



117/185 



Coding sequence Rv2753c predicted by Cole et al . f 1998 (Nature 393 537-544) 
containing Seq 37A 

1/1 31/11 

GTG ACC ACC GTC GGA TTC GAC GTC GCA GCG CGC CTA GGA ACC CTG CTG ACC GCG ATG GTG 
val thr thr val gly phe asp val ala ala arg leu gly thr leu leu thr ala met val 
61/21 91/31 

ACA CCG TTT AGC GGC GAT GGC TCC CTG GAC ACC GCC ACC GCG GCG CGG CTG GCC AAC CAC 
thr pro phe ser gly asp gly ser leu asp thr ala thr ala ala arg leu ala asn his 
121/41 " 151/51 

CTG GTC GAT CAG GGG TGC GAC GGT CTG GTG GTC TCG GGC ACC ACC GGC GAG TCG CCG ACC 
leu val asp gin gly cys asp gly leu val val ser gly thr thr gly glu ser pro thr 
181/61 211/71 

ACC ACC GAC GGG GAG AAA ATC GAG CTG CTG CGG GCC GTC TTG GAA GCG GTG GGG GAC CGG 
thr thr asp gly glu lys ile glu leu leu arg ala val leu glu ala val gly asp arg 
241/81 271/91 

GCC CGT GTT ATC GCC GGT GCC GGC ACC TAT GAC ACC GCG CAC AGC ATC CGG CTG GCC AAG 
ala arg val ile ala gly ala gly thr tyr asp thr ala his ser ile arg leu ala lys 
301/101 331/111 

GCT TGT GCG GCC GAG GGT GCG CAC GGG CTG CTG GTG GTC ACG CCC TAC TAT TCC AAG CCG 
ala cys ala ala glu gly ala his gly leu leu val val thr pro tyr tyr ser lys pro 
361/121 391/131 

CCG CAG CGG GGG CTG CAA GCC CAT TTC ACC GCC GTC GCC GAC GCG ACC GAG CTG CCG ATG 
pro gin arg gly leu gin ala his phe thr ala val ala asp ala thr glu leu pro met 
421/141 451/151 

CTG CTC TAT GAC ATC CCG GGG CGG TCG GCG GTG CCG ATC GAG CCC GAC ACG ATC CGC GCG 
leu leu tyr asp ile pro gly arg ser ala val pro ile glu pro asp thr ile arg ala 
481/161 511/171 

TTG GCG TCG CAT CCG AAC ATC GTC GGA GTC AAG GAC GCC AAA GCC GAC CTG CAC AGC GGC 
leu ala ser his pro asn ile val gly val lys asp ala lys ala asp leu his ser gly 
541/181 571/191 

GCC CAA ATC ATG GCC GAC ACC GGA CTG GCC TAC TAT TCC GGC GAC GAC GCG CTC AAC CTG 
ala gin ile met ala asp thr gly leu ala tyr tyr ser gly asp asp ala leu asn leu 
601/201 631/211 

CCC TGG CTG GCC ATG GGC GCC ACG GGC TTC ATC AGC GTG ATT GCC CAC CTG GCA GCC GGG 
pro trp leu ala met gly ala thr gly phe ile ser val ile ala his leu ala ala gly 
661/221 691/231 

CAG CTT CGA GAG TTG TTG TCC GCC TTC GGT TCT GGG GAT ATC GCC ACC GCC CGC AAG ATC 
gin leu arg glu leu leu ser ala phe gly ser gly asp ile ala thr ala arg lys ile 
721/241 751/251 

AAC ATT GCG GTC GCC CCG CTG TGC AAC GCG ATG AGC CGC CTG GGT GGG GTG ACG TTG TCC 
asn ile ala val ala pro leu cys asn ala met ser arg leu gly gly val thr leu ser 
781/261 811/271 

AAG GCG GGC TTG CGG CTG CAG GGC ATC GAC GTC GGT GAT CCC CGG CTG CCC CAG GTG GCC 
lys ala gly leu arg leu gin gly ile asp val gly asp pro arg leu pro gin val ala 
841/281 871/291 

GCG ACA CCG GAG CAG ATC GAC GCG TTG GCC GCC GAC ATG CGC GCG GCC TCG GTG CTT CGG 
ala thr pro glu gin ile asp ala leu ala ala asp met arg ala ala ser val leu arg 

901/301 

TGA 

OPA 



SEQ ID No.37D 
FIGURE 37D 



REPLACEMENT SHEET (RULE 26) 




118/185 



ORF according to Cole et al . , 1998 (Nature 393 537-544) containing Rv2753c 
1/1 31/11 

TAA GGT GAG CGC CGT GGC CGA GAC CGC GCC GCT GCG CGT GCA ACT GAT CGC CAA GAC CGA 
OCH gly glu arg arg gly arg asp arg ala ala ala arg ala thr asp arg gin asp arg 
61/21 91/31 

CTT CTT GGC CCC ACC CGA CGT GCC CTG GAC CAC CGA CGC CGA CGG CGG ACC CGC GCT GGT 
leu leu gly pro thr arg arg ala leu asp his arg arg arg arg arg thr arg ala gly 
121/41 151/51 

CGA GTT CGC CGG CCG GGC CTG CTA TCA GAG CTG GTC CAA GCC CAA TCC CAA GAC CGC CAC 
arg val arg arg pro gly leu leu ser glu leu val gin ala gin ser gin asp arg his 
181/61 211/71 

CAA CGC CGG CTA CCT CCG GCA CAT CAT CGA CGT CGG ACA TTT CTC GGT GCT AGA GCA TGC 
gin arg arg leu pro pro ala his his arg arg arg thr phe leu gly ala arg ala cys 
241/81 271/91 

CAG CGT GTC GTT CTA CAT CAC CGG GAT CTC GCG ATC GTG CAC CCA CGA GCT GAT CCG CCA 
gin arg val val leu his his arg asp leu ala ile val his pro arg ala asp pro pro 
301/101 331/111 

CCG GCA TTT CTC CTA CTC GCA GCT CTC CCA GCG CTA CGT ACC CGA GAA GGA CTC GCG GGT 
pro ala phe leu leu leu ala ala leu pro ala leu arg thr arg glu gly leu ala gly 
361/121 391/131 

CGT CGT GCC GCC CGG CAT GGA GGA CGA CGC CGA= CCT GCG CCA CAT CCT GAC CGA GGC CGC 
arg arg ala ala arg his gly gly arg arg arg pro ala pro his pro asp arg gly arg 
421/141 451/151 

CGA CGC CGC CCG CGC CAC CTA CAG CGA GCT GCT GGC CAA GCT GGA AGC CAA GTT CGC CGA 
arg arg arg pro arg his leu gin arg ala ala gly gin ala gly ser gin val arg arg 
481/161 511/171 

CCA ACC CAA CGC GAT CCT GCG CCG CAA GCA GGC CCG CCA AGC CGC CCG CGC GGT GCT GCC 
pro thr gin arg asp pro ala pro gin ala gly pro pro ser arg pro arg gly ala ala 
541/181 571/191 

CAA CGC CAC CGA AAC CCG CAT CGT GGT GAC CGG CAA CTA CCG GGC CTG GCG GCA CTT CAT 
gin arg his arg asn pro his arg gly asp arg gin leu pro gly leu ala ala leu his 
601/201 631/211 

CGC AAT GCG GGC CAG CGA GCA CGC CGA CGT GGA AAT CCG GCG ACT GGC CAT CGA ATG CCT 
arg asn ala gly gin arg ala arg arg arg gly asn pro ala thr gly his arg met pro 
661/221 . 691/231 

GCG CCA GCT CGC CGC CGT GGC CCC CGC GGT GTT CGC CGA CTT CGA GGT GAC CAC CCT GGC 
ala pro ala arg arg arg gly pro arg gly val arg arg leu arg gly asp his pro gly 
721/241 751/251 

CGA CGG CAC CGA GGT GGC GAC CAG CCC GTT GGC GAC CGA AGC CTG AGG CGG CGT GTC GCT 
arg arg his arg gly gly asp gin pro val gly asp arg ser leu arg arg arg val ala 
781/261 811/271 

GGA CAA ACA CGC GCG CTC GCG GCC GGG ATA AAG CGC CAG GTA ACC TTG GGA GCC GTG ACC 
gly gin thr arg ala leu ala ala gly ile lys arg gin val thr leu gly ala val thr 
841/281 871/291 

ACC GTC GGA TTC GAC GTC GCA GCG CGC CTA GGA ACC CTG CTG ACC GCG ATG GTG ACA CCG 
thr val gly phe asp val ala ala arg leu gly thr leu leu thr ala met val thr pro 
901/301 931/311 

TTT AGC GGC GAT GGC TCC CTG GAC ACC GCC ACC GCG GCG CGG CTG GCC AAC CAC CTG GTC 
phe ser gly asp gly ser leu asp thr ala thr ala ala arg leu ala asn his leu val 
961/321 991/331 

GAT CAG GGG TGC GAC GGT CTG GTG GTC TCG GGC ACC ACC GGC GAG TCG CCG ACC ACC ACC 
asp gin gly cys asp gly leu val val ser gly thr thr gly glu ser pro thr thr thr 



SEQ ID No.37F 
FIGURE 37F 



REPLACEMENT SHEET (RULE 26) 



• 

119/185 



1021/341 
















1051/351 
















GAC GGG GAG 


AAA 


ATC 


GAG 


CTG 


CTG 


CGG 


GCC 


GTC TTG GAA 


GCG 


GTG 


GGG 


GAC 


CGG 


GCC 


CGT 


asp gly glu 


lys 


ile 


glu 


leu 


leu 


arg 


ala 


val leu glu 


ala 


val 


gly 


asp 


arg 


ala 


arg 


1081/361 














1111/371 
















GTT ATC GCC 


GGT 


GCC 


GGC 


ACC 


TAT 


GAC 


ACC 


GCG CAC AGC 


ATC 


CGG 


CTG 


GCC 


AAG 


GCT 


TGT 


val ile ala 


gly 


ala 


gly 


thr 


tyr 


asp 


thr 


ala his ser 


ile 


arg 


leu 


ala 


lys 


ala 


cys 


1141/381 














1171/391 
















GCG GCC GAG 


GGT 


GCG 


CAC 


GGG 


CTG 


CTG 


GTG 


GTC ACG CCC 


TAC 


TAT 


TCC 


AAG 


CCG 


CCG 


CAG 


ala ala glu 


gly 


ala 


his 


gly 


leu 


leu 


val 


val thr pro 


tyr 


tyr 


ser 


lys 


pro 


pro 


gin 


1201/401 














1231/411 
















CGG GGG CTG 


CAA 


GCC 


CAT 


TTC 


ACC 


GCC 


GTC 


GCC GAC GCG 


ACC 


GAG 


CTG 


CCG 


ATG 


CTG 


CTC 


arg gly leu 


gin 


ala 


his 


phe 


thr 


ala 


val 


ala asp ala 


thr 


glu 


leu 


pro 


met 


leu 


leu 


1261/421 














1291/431 
















TAT GAC ATC 


CCG 


GGG 


CGG 


TCG 


GCG 


GTG 


CCG 


ATC GAG CCC 


GAC 


ACG 


ATC 


CGC 


GCG 


TTG 


GCG 


tyr asp ile 


pro 


gly 


arg 


ser 


ala 


val 


pro 


ile glu pro 


asp 


thr 


ile 


arg 


ala 


leu 


ala 


1321/441 












1351/451 
















TCG CAT CCG 


AAC 


ATC 


GTC 


GGA 


GTC 


AAG 


GAC 


GCC AAA GCC 


GAC 


CTG 


CAC 


AGC 


GGC 


GCC 


CAA 


ser his pro 


asn 


ile 


val 


gly 


val 


lys 


asp 


ala lys ala 


asp 


leu 


his 


ser 


gly 


ala 


gin 


1381/461 
















1411/471 
















ATC ATG GCC 


GAC 


ACC 


GGA 


CTG 


GCC 


TAC 


TAT 


TCC GGC GAC 


GAC 


GCG 


CTC 


AAC 


CTG 


CCC 


TGG 


ile met ala 


asp 


thr 


gly 


leu 


ala 


tyr 


tyr 


ser gly asp 


asp 


ala 


leu 


asn 


leu 


pro 


trp 


1441/481 














1471/491 
















CTG GCC ATG 


GGC 


GCC 


ACG 


GGC 


TTC 


ATC 


AGC 


GTG ATT GCC 


CAC 


CTG 


GCA 


GCC 


GGG 


CAG 


CTT 


leu ala met 


gly 


ala 


thr 


gly 


phe 


ile 


ser 


val ile ala 


his 


leu 


ala 


ala 


gly 


gin 


leu 


1501/501 














1531/ 511 
















CGA GAG TTG 


TTG 


TCC 


GCC 


TTC 


GGT 


TCT 


GGG 


GAT ATC GCC 


ACC 


GCC 


CGC 


AAG 


ATC 


AAC 


ATT 


arg glu leu 


leu 


ser 


ala 


phe 


gly 


ser 


gly 


asp ile ala 


thr 


ala 


arg 


lys 


ile 


asn 


ile 


1561/521 
















1591/531 
















GCG GTC GCC 


CCG 


CTG 


TGC 


AAC 


GCG 


ATG 


AGC 


CGC CTG GGT 


GGG 


GTG 


ACG 


TTG 


TCC 


AAG 


GCG 


ala val ala 


pro 


leu 


cys 


asn 


ala 


met 


ser 


arg leu gly 


gly 


val 


thr 


leu 


ser 


lys 


ala 


1621/541 














1651/551 
















GGC TTG CGG 


CTG 


CAG 


GGC 


ATC 


GAC 


GTC 


GGT 


GAT CCC CGG 


CTG 


CCC 


CAG 


GTG 


GCC 


GCG 


ACA 


gly leu arg 


leu 


gin 


gly 


ile 


asp 


val 


gly 


asp pro arg 


leu 


pro 


gin 


val 


ala 


ala 


thr 


1681/561 
















1711/571 
















CCG GAG CAG 


ATC 


GAC 


GCG 


TTG 


GCC 


GCC 


GAC 


ATG CGC GCG 


GCC 


TCG 


GTG 


CTT 


CGG 


TGA 




pro glu gin 


ile 


asp 


ala 


leu 


ala 


ala 


asp 


met arg ala 


ala 


ser 


val 


leu 


arg 


OPA 





SEQ ID No.37F (continued 1) 
FIGURE 37F (continued 1) 



REPLACEMENT SHEET (RULE 26) 



120/185 



l/l 

GCG GTG AAC TGG TGG GCC CGG ATG GTT CAA 
ala val asn trp trp ala arg met val gin 
61/21 

AGA CGA CGG ATG GAA GGA GAT GCT GGC GCC 
arg arg arg met glu gly asp ala gly ala 
121/41 

TCG TCG TCT ACG GAG GTG AAG GCG GCG GAT 
ser ser ser thr glu val lys ala ala asp 
181/61 

ACT GGC CCG GAG GTG AAG GCG GCG GAT TCG 
thr gly pro gin val lys ala ala asp ser 
241/81 

GAC GCG TGC CCA GAA CAG GCC CTC GTC GAG 
asp ala cys pro glu gin ala leu val glu 
301/101 

CTT GTT GGC ATT GCG GCG ACG CTG CTC GCG 
leu val gly ile ala ala thr leu leu ala 
361/121 

TTT GCG TTG CGC TCA CAC CAG GAA AGC CAA 
phe ala leu arg ser his gin glu ser gin 
421/141 

GAG GCC GCT AAG GAT TGC GTT GCG GCC ACG 
glu ala ala lys asp cys val ala ala thr 
481/161 

AGC ATG CAG AAG ATC 
ser met gin lys ile 



31/11 

GTA CGC CGT CGC AAA CTC GAG CAC AAC AGG 
val arg arg arg lys leu glu his asn arg 
91/31 

GGC CAG CTG AAC CCT GCC GAT GCG AAT AAG 
gly gin leu asn pro ala asp ala asn lys 
151/51 

TCG GCG GAA TCT GAC GCC GGA GCC GAC CAG 
ser ala glu ser asp ala gly ala asp gin 
211/71 

GCG GAA TCT GAC GCC GGA GAG CTC GGC GAG 
ala glu ser asp ala gly glu leu gly glu 
271/91 

CGG CGC CCG TCG CGG TTG CGG CGA GGC TGG 
arg arg pro ser arg leu arg arg gly trp 
331/111 

TTG GCC GGT GGC CTT GGC GCA GCG GGT TAT 
leu ala gly gly leu gly ala ala gly tyr 
391/131 

TCA ATC GCG CGC GAG GAC CTT GCG GCC ATT 
ser ile ala arg glu asp leu ala ala ile 
451/151 

CAG GCA CCC GAT GCT GGG GCG ATG TCG GCT 
gin ala pro asp ala gly ala met ser ala 



ID No.38A 



FIGURE 38A 



l/l 

CAG CGG TGA ACT GGT GGG CCC GGA TGG TTC 
gin arg OPA thr gly gly pro gly trp phe 
61/21 

GGA GAC GAC GGA TGG AAG GAG ATG CTG GCG 
gly asp asp gly trp lys glu met leu ala 
121/41 

AGT CGT CGT CTA CGG AGG TGA AGG CGG CGG 
ser arg arg leu arg arg OPA arg arg arg 
181/61 

AGA CTG GCC CGC AGG TGA AGG CGG CGG ATT 
arg leu ala arg arg OPA arg arg arg ile 
241/81 

AGG ACG CGT GCC CAG AAC AGG CCC TCG TCG 
arg thr arg ala gin asn arg pro ser ser 
301/101 

GGC TTG TTG GCA TTG CGG CGA CGC TGC TCG 
gly leu leu ala leu arg arg arg cys ser 
361/121 

ATT TTG CGT TGC GCT CAC ACC AGG AAA GCC 
ile leu arg cys ala his thr arg lys ala 
421/141 

TTG AGG CCG CTA AGG ATT GCG TTG CGG CCA 
leu arg pro leu arg ile ala leu arg pro 
481/161 

CTA GCA TGC AGA AGA TC 
leu ala cys arg arg 



31/11 

AAG TAC GCC GTC GCA AAC TCG AGC ACA ACA 
lys tyr ala val ala asn ser ser thr thr 
91/31 

CCG GCC AGC TGA ACC CTG CCG ATG CGA ATA 
pro ala ser OPA thr leu pro met arg ile 
151/51 

ATT CGG CGG AAT CTG ACG CCG GAG CCG ACC 
ile arg arg asn leu thr pro glu pro thr 
211/71 

CGG CGG AAT CTG ACG CCG GAG AGC TCG GCG 
arg arg asn leu thr pro glu ser ser ala 
271/91 

AGC GGC GCC CGT CGC GGT TGC GGC GAG GCT 
ser gly ala arg arg gly cys gly glu ala 
331/111 

CGT TGG CCG GTG GCC TTG GCG CAG CGG GTT 
arg trp pro val ala leu ala gin arg val 
391/131 

AAT CAA TCG CGC GCG AGG ACC TTG CGG CCA 
asn gin ser arg ala arg thr leu arg pro 
451/151 

CGC AGG CAC CCG ATG CTG GGG CGA TGT CGG 
arg arg his pro met leu gly arg cys arg 



SEQ ID No.38B 
FIGURE 38B 
REPLACEMENT SHEET (RULE 26) 




121/185 



1/1 


















31/11 


















AGC GGT 


GAA 


CTG 


GTG 


GGC 


CCG 


GAT 


GGT 


TCA 


AGT ACG 


CCG 


TCG 


CAA 


ACT 


CGA 


GCA 


CAA 


CAG 


ser gly 


alu 

J. 


leu 


val 


gly 


pro 


asp 


gly 


ser 


ser thr 


pro 


ser 


gin 


thr 


arg 


ala 


gin 


gin 


61/21 
















91/31 


















GAG ACG 


ACG 


GAT 


GGA 


AGG 


AGA 


TGC 


TGG 


CGC 


CGG CCA 


GCT 


GAA 


CCC 


TGC 


CGA 


TGC 


GAA 


TAA 


glu thr 


thr 


asp 


gly 


arg 


arg 


cys 


trp 


arg 


arg pro 


ala 


glu 


pro 


cys 


arg 


cys 


glu 


OCH 


121/41 


















151/51 


















GTC GTC 


GTC 


TAC 


GGA 


GGT 


GAA 


GGC 


GGC 


GGA 


TTC GGC 


GGA 


ATC 


TGA 


CGC 


CGG 


AGC 


CGA 


CCA 


val val 


val 


tyr 


gly 


gly 


glu 


gly 


gly 


gly 


phe gly 


gly 


ile 


OPA 


arg 


arg 


ser 


arg 


pro 


181/61 
















211/71 


















GAC TGG 


CCC 


GCA 


GGT 


GAA 


GGC 


GGC 


GGA 


TTC 


GGC GGA 


ATC 


TGA 


CGC 


CGG 


AGA 


GCT 


CGG 


CGA 


asp trp 


pro 


ala 


gly 


glu 


gly 


gly 


gly phe 


gly gly 


ile 


OPA 


arg 


arg 


arg 


ala 


arg 


arg 


241/81 
















271/91 


















GGA CGC 


GTG 


CCC 


AGA 


ACA 


GGC 


CCT 


CGT 


CGA 


GCG GCG 


CCC 


GTC 


GCG 


GTT 


GCG 


GCG 


AGG 


CTG 


gly arg 


val 


pro 


arg 


thr 


gly 


pro 


arg 


arg 


ala ala 


pro 


val 


ala 


val 


ala 


ala 


arg 


leu 


301/101 


















331/111 


















GCT TGT 


TGG 


CAT 


TGC 


GGC 


GAC 


GCT 


GCT 


CGC 


GTT GGC 


CGG 


TGG 


CCT 


TGG 


CGC 


AGC 


GGG 


TTA 


ala cys 


trp 


his 


cys 


gly 


asp 


ala 


ala 


arg 


val gly 


arg 


trp 


pro 


trp 


arg 


ser 


gly 


leu 


361/121 
















391/131 


















TTT TGC 


GTT 


GCG 


CTC 


ACA 


CCA 


GGA 


AAG 


CCA 


ATC AAT 


CGC 


GCG 


CGA 


GGA 


CCT 


TGC 


GGC 


CAT 


phe cys 


val 


ala 


leu 


thr 


pro 


gly 


lys 


pro 


ile asn 


arg 


ala 


arg 


gly 


pro 


cys 


gly 


his 


421/141 


















451/151 


















TGA GGC 


CGC 


TAA 


GGA 


TTG 


CGT 


TGC 


GGC 


CAC 


GCA GGC 


ACC 


CGA 


TGC 


TGG 


GGC 


GAT 


GTC 


GGC 


OPA gly 


arg 


OCH 


gly 


leu 


arg 


cys 


gly 


his 


ala gly 


thr 


arg 


cys 


trp 


gly 


asp 


val 


gly 


481/161 




































TAG CAT 


GCA 


GAA 


GAT 


C 




























AMB his 


ala 


glu 


asp 































SEQ ID No.38C 
FIGURE 38C 



REPLACEMENT SHEET (RULE 26) 



122/185 

Sequence Rv0175 predicted by Cole et al., 1998 (Nature 393 537-544) and 
containing seq38A 



1/1 31/11 



GTG AAG 


GCG 


GCG 


GAT 


TCG 


GCG 


GAA 


TCT 


GAC 


GCC GGA 


GCC 


GAC 


CAG 


ACT 


GGC 


CCG 


CAG 


GTG 


val lys 


ala 


ala 


asp 


ser 


ala 


glu 


ser 


asp 


ala gly 


ala 


asp 


gin 


thr 


gly 


pro 


gin 


val 


61/21 


















91/31 


















AAG GCG 


GCG 


GAT 


TCG 


GCG 


GAA 


TCT 


GAC 


GCC 


GGA GAG 


CTC 


GGC 


GAG 


GAC 


GCG 


TGC 


CCA 


GAA 


lys ala 


ala 


asp 


ser 


ala 


glu 


ser 


asp 


ala 


gly glu 


leu 


gly 


glu 


asp 


ala 


cys 


pro 


glu 


121/41 
















151/51 


















CAG GCC 


CTC 


GTC 


GAG 


CGG 


CGC 


CCG 


TCG 


CGG 


TTG CGG 


CGA 


GGC 


TGG 


CTT 


GTT 


GGC 


ATT 


GCG 


gin ala 


leu 


val 


glu 


arg 


arg 


pro 


ser 


arg 


leu arg 


arg 


gly 


trp 


leu 


val 


gly 


ile 


ala 


181/61 


















211/71 


















GCG ACG 


CTG 


CTC 


GCG 


TTG 


GCC 


GGT 


GGC 


CTT 


GGC GCA 


GCG 


GGT 


TAT 


TTT 


GCG 


TTG 


CGC 


TCA 


ala thr 


leu 


leu 


ala 


leu 


ala 


gly 


gly 


leu 


gly ala 


ala 


gly 


tyr 


phe 


ala 


leu 


arg 


ser 


241/81 


















271/91 


















CAC CAG 


GAA 


AGC 


CAA 


TCA 


ATC 


GCG 


CGC 


GAG 


GAC CTT 


GCG 


GCC 


ATT 


GAG 


GCC 


GCT 


AAG 


GAT 


his gin 


glu 


ser 


gin 


ser 


ile 


ala 


arg 


glu 


asp leu 


ala 


ala 


ile 


glu 


ala 


ala 


lys 


asp 


301/101 
















331/111 


















TGC GTT 


GCG 


GCC 


ACG 


CAG 


GCA 


CCC 


GAT 


GCT 


GGG GCG 


ATG 


TCG 


GCT 


AGC 


ATG 


CAG 


AAG 


ATC 


cys val 


ala 


ala 


thr 


gin 


ala 


pro 


asp 


ala 


gly ala 


met 


ser 


ala 


ser 


met 


gin 


lys 


ile 


361/121 


















391/131 


















ATC GAG 


TGT 


GGC 


ACC 


GGT 


GAT 


TTC 


GGT 


GCC 


CAG GCG 


TCG 


TTG 


1 AL. 


ALL 




J-\±. VJ 






ile glu 


cys 


gly 


thr 


gly 


asp 


phe 


gly 


ala 


gin ala 


ser 


leu 


tyr 


thr 


ser 


met 


leu 


val 


421/141 














451/151 


















GAG GCG 


TAT 


CAA 


GCG 


GCC 


AGC 


GTC 


CAC 


GTG 


CAA GTG 


ACC 


GAT 


ATG 


CGC 


GCG 


GCG 


GTC 


GAG 


glu ala 


tyr 


gin 


ala 


ala 


ser 


val 


his 


val 


gin val 


thr 


asp 


met 


arg 


ala 


ala 


val 


glu 


481/161 














511/171 


















CGC AAC 


AAC 


AAT 


GAC 


GGG 


TCG 


GTC 


GAT 


GTT 


CTG GTG 


GCG 


CTC 


CGG 


GTC 


AAG 


GTG 


TCC 


AAC 


arg asn 


asn 


asn 


asp 


gly 


ser 


val 


asp 


val 


leu val 


ala 


leu 


arg 


val 


lys 


val 


ser 


asn 


541/181 


















571/191 
















GAG 


ACC GAC 


TCG 


GAT 


GCC 


CAT 


GAA 


GTC 


GGC 


TAC 


CGT CTT 


CGG 


GTC 


CGG 


ATG 


GCA 


CTG 


GAT 


thr asp 


ser 


asp 


ala 


his 


glu 


val 


gly 


tyr 


arg leu 


arg 


val 


arg 


met 


ala 


leu 


asp 


glu 


601/201 
















631/211 


















GGC CGC 


TAT 


AAG 


ATC 


GCC 


AAA 


CTC 


GAC 


CAG 


GTG ACG 


AAG 


TGA 














gly arg 


tyr 


lys 


ile 


ala 


lys 


leu 


asp 


gin 


val thr 


lys 


OPA 















SEQ ID No.38D 
FIGURE 38D 



REPLACEMENT SHEET (RULE 26) 




ORF 



123/185 



according to Cole et al . , 1998 (Nature 393 537-544) containing Rv0175 



1/1 31/11 

TGA ACT GGT GGG GCC GGA TGG TGT CAA GTA CGC CGT CGC AAA CTC GAG CAC AAC AGG AGA 
OPA thr gly gly ala gly trp cys gin val arg arg arg lys leu glu his asn arg arg 
61/21 91/31 

CGA CGG ATG GAA GGA GAT GCT GGC GCC GGC CAG CTG AAC CCT GCC GAT GCG AAT AAG TCG 
arg arg met glu gly asp ala gly ala gly gin leu asn pro ala asp ala asn lys ser 
121/41 151/51 

TCG TCT ACG GAG GTG AAG GCG GCG GAT TCG GCG GAA TCT GAC GCC GGA GCC GAC CAG ACT 
ser ser thr glu val lys ala ala asp ser ala glu ser asp ala gly ala asp gin thr 
181/61 211/71 

GGC CCG CAG GTG AAG GCG GCG GAT TCG GCG GAA TCT GAC GCC GGA GAG CTC GGC GAG GAC 
gly pro gin val lys ala ala asp ser ala glu ser asp ala gly glu leu gly glu asp 
241/81 271/91 

GCG TGC CCA GAA CAG GCC CTC GTC GAG CGG CGC CCG TCG CGG TTG CGG CGA GGC TGG CTT 
ala cys pro glu gin ala leu val glu arg arg pro ser arg leu arg arg gly trp leu 
301/101 331/111 

GTT GGC ATT GCG GCG ACG CTG CTC GCG TTG GCC GGT GGC CTT GGC GCA GCG GGT TAT TTT 
val gly ile ala ala thr leu leu ala leu ala gly gly leu gly ala ala gly tyr phe 
361/121 391/131 

GCG TTG CGC TCA CAC CAG GAA AGC CAA TCA ATC GCG CGC GAG GAC CTT GCG GCC ATT GAG 
ala leu arg ser his gin glu ser gin ser ile ala arg glu asp leu ala ala ile glu 
421/141 451/151 

GCC GCT AAG GAT TGC GTT GCG GCC ACG CAG GCA CCC GAT GCT GGG GCG ATG TCG GCT AGC 
ala ala lys asp cys val ala ala thr gin ala pro asp ala gly ala met ser ala ser 
481/161 511/171 

ATG CAG AAG ATC ATC GAG TGT GGC ACC GGT GAT TTC GGT GCC CAG GCG TCG TTG TAC ACC 
met gin lys ile ile glu cys gly thr gly asp phe gly ala gin ala ser leu tyr thr 
541/181 571/191 

AGC ATG CTC GTC GAG GCG TAT CAA GCG GCC AGC GTC CAC GTG CAA GTG ACC GAT ATG CGC 
ser met leu val glu ala tyr gin ala ala ser val his val gin val thr asp met arg 
601/201 631/211 

GCG GCG GTC GAG CGC AAC AAC AAT GAC GGG TCG GTC GAT GTT CTG GTG GCG CTC CGG GTC 
ala ala val glu arg asn asn asn asp gly ser val asp val leu val ala leu arg val 
661/221 691/231 

AAG GTG TCC AAC ACC GAC TCG GAT GCC CAT GAA GTC GGC TAC CGT CTT CGG GTC CGG ATG 
lys val ser asn thr asp ser asp ala his glu val gly tyr arg leu arg val arg met 
721/241 751/251 

GCA CTG GAT GAG GGC CGC TAT AAG ATC GCC AAA CTC GAC CAG GTG ACG AAG TGA 
ala leu asp glu gly arg tyr lys ile ala lys leu asp gin val thr lys OPA 



SEQ ID No.38F 
FIGURE 38F 



REPLACEMENT SHEET (RULE 26) 




124/185 



1/1 31/11 

ACA CCT CCC CCC CCG CCG CCG CTG CCG CCG GTT CCC TTT CCC AAG GAA TGT CCG GCG CCG 
thr pro pro pro pro pro pro leu pro pro val pro phe pro lys glu cys pro ala pro 
61/21 91/31 

GGC GTG ATG CAA GGC TGC CTT GAG AGC ACC AGC GGC TTG ATC ATG GGC ATC GAC AGC AAG 
gly val met gin gly cys leu glu ser thr ser gly leu ile met gly ile asp ser lys 
121/41 151/51 

ACC GCA CTG GTC GCC GAG CGC ATC ACC GGT GCC GTC GAG GAG ATC 
thr ala leu val ala glu arg ile thr gly ala val glu glu ile 



SEQ ID No,39A 
FIGURE 39A 



1/1 

CAC CTC CCC CCC CGC 
his leu pro pro arg 
61/21 

GCG TGA TGC AAG GCT 
ala OPA cys lys ala 
121/41 

CCG CAC TGG TCG CCG 
pro his trp ser pro 



CGC CGC TGC CGC CGG 
arg arg cys arg arg 

GCC TTG AGA GCA CCA 
ala leu arg ala pro 

AGC GCA TCA CCG GTG 
ser ala ser pro val 



31/11 

TTC CCT TTC CCA AGG 
phe pro phe pro arg 
91/31 

GCG GCT TGA TCA TGG 
ala ala OPA ser trp 
151/51 

CCG TCG AGG AGA TC 
pro ser arg arg 



AAT GTC CGG CGC CGG 
asn val arg arg arg 

GCA TCG ACA GCA AGA 
ala ser thr ala arg 



SEQ ID No.39B 
FIGURE 39B 



1/1 31/11 

GAC ACC TCC CCC CCC GCC GCC GCT GCC GCC GGT TCC CTT TCC CAA GGA ATG TCC GGC GCC 

asp thr ser pro pro ala ala ala ala ala gly ser leu ser gin gly met ser gly ala 

61/21 91/31 

GGG CGT GAT GCA AGG CTG CCT TGA GAG CAC CAG CGG CTT GAT CAT GGG CAT CGA CAG CAA 

gly arg asp ala arg leu pro OPA glu his gin arg leu asp his gly his arg gin gin 

121/41 151/51 

GAC CGC ACT GGT CGC CGA GCG CAT. CAC CGG TGC CGT CGA GGA GAT C 

asp arg thr gly arg arg ala his his arg cys arg arg gly asp 



SEQ ID No.39C 
FIGURE 39C 



REPLACEMENT SHEET (RULE 26) 



4» 




125/185 

Coding sequence Rv3006 predicted by Cole et al., 1998 {Nature 393 537-544) and containing 
seq39A 

1/1 31/11 

ATG TGG ACA ACG CGG TTG GTT CGA TCC GGA CTC GCC GCG CTG TGC GCG GCA GTG CTG GTA 
Met trp thr thr arg leu val arg ser gly leu ala ala leu cys ala ala val leu val 
61/21 



61/21 ^x/jx 

TCG AGC GGC TGC GCA CGG TTC AAC GAC GCT CAA TCT CAG CCG TTC ACC ACC GAA CCG GAG 
ser ser gly cys ala arg phe asn asp ala gin ser gin pro phe thr thr glu pro glu 
121/41 151/51 

CTG CGG CCC CAA CCC AGC TCG ACA CCT CCC CCC CCG CCG CCG CTG CCG CCG GTT CCC TTT 
leu arg pro gin pro ser ser thr pro pro pro pro pro pro leu pro pro val pro phe 
181/61 211/71 

CCC AAG GAA TGT CCG GCG CCG GGC GTG ATG CAA GGC TGC CTT GAG AGC ACC AGC GGC TTG 
pro lys glu cys pro ala pro gly val met gin gly cys leu glu ser thr ser gly leu 
241/81 271/91 

ATC ATG GGC ATC GAC AGC AAG ACC GCA CTG GTC GCC GAG CGC ATC ACC GGT GCC GTC GAG 
ile met gly ile asp ser lys thr ala leu val ala glu arg ile thr gly ala val glu 
301/101 331/111 

GAG ATC TCT ATC AGC GCC GAG CCG AAG GTA AAG ACG GTC ATC CCC GTG GAT CCT GCC GGT 
qlu ile ser ile ser ala glu pro lys val lys thr val ile pro val asp pro ala gly 
361/121 391/131 

GAC GGT GGC TTG ATG GAC ATT GTG CTG TCG CCC ACC TAC TCG CAA GAC CGG CTG ATG TAC 
asp qlv gly leu met asp ile val leu ser pro thr tyr ser gin asp arg leu met tyr 
421/141 451/151 

GCC TAC ATC AGC ACG CCC ACC GAC AAC CGG GTG GTG CGA GTG GCC GAC GGC GAC ATC CCC 
ala tyr ile ser thr pro thr asp asn arg val val arg val ala asp gly asp lie pro 
481/161 511/171 

AAG GAC ATC CTG ACC GGC ATC CCC AAA GGT GCT GCC GGT AAC ACC GGG GCG CTG ATC TTC 
lys asp ile leu thr gly ile pro lys gly ala ala gly asn thr gly ala leu He phe 
541/181 571/191 

ACC AGT CCC ACC ACG CTG GTC GTG ATG ACC GGG GAT GCT GGC GAC CCG GCG TTG GCC GCC 
thr ser pro thr thr leu val val met thr gly asp ala gly asp pro ala leu ala ala 
601/201 631/211 

GAT CCC CAA TCG TTG GCC GGT AAG GTC CTG CGT ATC GAA CAG CCC ACC ACC ATC GGC CAG 
asp pro gin ser leu ala gly lys val leu arg ile glu gin pro thr thr ile gly gin 
661/221 691/231 

ACG CCG CCG ACG ACG GCG CTG TCT GGC ATC GGC TCC GGC GGC GGC TTG TGC ATC GAT CCG 
thr pro pro thr thr ala leu ser gly ile gly ser gly gly gly leu cys ile asp pro 
721/241 751/251 

GTC GAC GGC TCG CTA TAT GTC GCC GAC CGC ACG CCA ACG GCG GAC CGA TTG CAG CGC ATC 
val asp gly ser leu tyr val ala asp arg thr pro thr ala asp arg leu gin arg lie 
781/261 811/271 

ACC AAG AAC TCG GAG GTC TCT ACG GTA TGG ACC TGG CCG GAC AAG CCC GGC GTG GCC GGG 
thr lys asn ser glu val ser thr val trp thr trp pro asp lys pro gly val ala gly 
841/281 871/291 

TGT GCC GCG ATG GAC GGC ACC GTG CTG GTC AAC CTG ATT AAT ACC AAA CTG ACG GTG GCG 
cys ala ala met asp gly thr val leu val asn leu ile asn thr lys leu thr val ala 
901/301 931/311 

GTC CGG CTC GCG CCG TCG ACC GGT GCG GTC ACC GGA GAA CCC GAC GTT GTC CGC AAA GAC 
val arg leu ala pro ser thr gly ala val thr gly glu pro asp val val arg lys asp 
961/321 991/331 

ACT CAT GCG CAT GCG TGG GCA TTA CGG ATG TCG CCG GAC GGC AAC GTC TGG GGA GCC ACC 
thr his ala his ala trp ala leu arg met ser pro asp gly asn val trp gly ala thr 
1021/341 1051/351 

GTC AAC AAG ACC GCC GGC GAC GCC GAG AAG CTC GAC GAT GTG GTG TTC CCG CTG TTC CCG 

val asn lys thr ala gly asp ala glu lys leu asp asp val val phe pro leu phe pro 

1081/361 1111/371 

CAG GGT GGC GGC TTC CCG CGC AAC AAC GAC GAC AAG ACC TGA 

gin gly gly gly phe pro arg asn asn asp asp lys thr OPA 

SEQ ID No.39D 
FIGURE 39D 



REPLACEMENT SHEET (RULE 26) 



# 



126/185 



ORF according to Cole et al., 1998 (Nature 393 537-544) and containing Rv3006 
1/1 31/11 

TAA GGC CAT TTA GTG CCG AAT TGG GGA TTT GAG CGG CGC TTT CGC CAG ACA ATC CGC ACA 
OCH gly his leu val pro asn trp gly phe glu arg arg phe arg gin thr ile arg thr 
61/21 91/31 

TTG ACC CTG ACC AGC CCA CCA AAA GGC CCC AAT TGG GCC GCC ATG CCG ACA GTG CGC ACC 
leu thr leu thr ser pro pro lys gly pro asn trp ala ala met pro thr val arg thr 
121/41 151/51 

CCG GCA GGT GGC GGC GAT GCC CAC AAT GTC CGT AGC CTG TCG GTC ATG TGG ACA ACG CGG 
pro ala gly gly gly asp ala his asn val arg ser leu ser val met trp thr thr arg 
181/61 211/71 

TTG GTT CGA TCC GGA CTC GCC GCG CTG TGC GCG GCA GTG CTG GTA TCG AGC GGC TGC GCA 
leu val arg ser gly leu ala ala leu cys ala ala val leu val ser ser gly cys ala 
241/81 271/91 

CGG TTC AAC GAC GCT CAA TCT CAG CCG TTC ACC ACC GAA CCG GAG CTG CGG CCC CAA CCC 
arg phe asn asp ala gin ser gin pro phe thr thr glu pro glu leu arg pro gin pro 
301/101 331/111 

AGC TCG ACA CCT CCC CCC CCG CCG CCG CTG CCG CCG GTT CCC TTT CCC AAG GAA TGT CCG 
ser ser thr pro pro pro pro pro pro leu pro pro val pro phe pro lys glu cys pro 
361/121 391/131 

GCG CCG GGC GTG ATG CAA GGC TGC CTT GAG AGC ACC AGC GGC TTG ATC ATG GGC ATC GAC 
ala pro gly val met gin gly cys leu glu ser thr ser gly leu ile met gly ile asp 
421/141 451/151 

AGC AAG ACC GCA CTG GTC GCC GAG CGC ATC ACC GGT GCC GTC GAG GAG ATC TCT ATC AGC 
ser lys thr ala leu val ala glu arg ile thr gly ala val glu glu ile ser ile ser 
481/161 511/171 

GCC GAG CCG AAG GTA AAG ACG GTC ATC CCC GTG GAT CCT GCC GGT GAC GGT GGC TTG ATG 
ala glu pro lys val lys thr val ile pro val asp pro ala gly asp gly gly leu met 
541/181 571/191 

GAC ATT GTG CTG TCG CCC ACC TAC TCG CAA GAC CGG CTG ATG TAC GCC TAC ATC AGC ACG 
asp ile val leu ser pro thr tyr ser gin asp arg leu met tyr ala tyr ile ser thr 
601/201 631/211 

CCC ACC GAC AAC CGG GTG GTG CGA GTG GCC GAC GGC GAC ATC CCC AAG GAC ATC CTG ACC 
pro thr asp asn arg val val arg val ala asp gly asp ile pro lys asp ile leu thr 
661/221 691/231 

GGC ATC CCC AAA GGT GCT GCC GGT AAC ACC GGG GCG CTG ATC TTC ACC AGT CCC ACC ACG 
gly ile pro lys gly ala ala gly asn thr gly ala leu ile phe thr ser pro thr thr 
721/241 751/251 

CTG GTC GTG ATG ACC GGG GAT GCT GGC GAC CCG GCG TTG GCC GCC GAT CCC CAA TCG TTG 
leu val val met thr gly asp ala gly asp pro ala leu ala ala asp pro gin ser leu 
781/261 811/271 

GCC GGT AAG GTC CTG CGT ATC GAA CAG CCC ACC ACC ATC GGC CAG ACG CCG CCG ACG ACG 
ala gly lys val leu arg ile glu gin pro thr thr ile gly gin thr pro pro thr thr 
841/281 871/291 

GCG CTG TCT GGC ATC GGC TCC GGC GGC GGC TTG TGC ATC GAT CCG GTC GAC GGC TCG CTA 
ala leu ser gly ile gly ser gly gly gly leu cys ile asp pro val asp gly ser leu 
901/301 931/311 

TAT GTC GCC GAC CGC ACG CCA ACG GCG GAC CGA TTG CAG CGC ATC ACC AAG AAC TCG GAG 
tyr val ala asp arg thr pro thr ala asp arg leu gin arg ile thr lys asn ser glu 



SEQ ID No.39F 
FIGURE 39F 



REPLACEMENT SHEET (RULE 26) 



127/185 



961/321 
















991/331 
















GTC TCT ACG 


GTA 


TGG 


ACC 


TGG 


CCG 


GAC 


AAG 


CCC GGC GTG 


GCC 


GGG 


TGT 


GCC 


GCG 


ATG 


GAC 


val ser thr 


val 


trp 


thr 


trp 


pro 


asp 


lys 


pro gly val 


ala 


gly 


cys 


ala 


ala 


met 


asp 


1021/341 
















1051/351 














CCG 


GGC ACC GTG 


CTG 


GTC, AAC 


CTG 


ATT 


AAT 


ACC 


AAA CTG ACG 


GTG 


GCG 


GTC 


CGG 


CTC 


GCG 


gly thr val 


leu 


val 


asn 


leu 


ile 


asn 


thr 


lys leu thr 


val 


ala 


val 


arg 


leu 


ala 


pro 


1081/361 
















1111/371 














GCG 


TCG ACC GGT 


GCG 


GTC 


ACC 


GGA 


GAA 


CCC 


GAC 


GTT GTC CGC 


AAA 


GAC 


ACT 


CAT 


GCG 


CAT 


ser thr gly 


ala 


val 


thr 


gly 


glu 


pro 


asp 


val val arg 


lys 


asp 


thr 


his 


ala 


his 


ala 


1141/381 
















1171/391 
















TGG GCA TTA 


CGG 


ATG 


TCG 


CCG 


GAC 


GGC 


AAC 


GTC TGG GGA 


GCC 


ACC 


GTC 


AAC 


AAG 


ACC 


GCC 


trp ala leu 


arg 


met 


ser 


pro 


asp 


gly 


asn 


val trp gly 


ala 


thr 


val 


asn 


lys 


thr 


ala 


1201/401 














1231/411 
















GGC GAC GCC 


GAG 


AAG 


CTC 


GAC 


GAT 


GTG 


GTG 


TTC CCG CTG 


TTC 


CCG 


CAG 


GGT 


GGC 


GGC 


TTC 


gly asp ala 


glu 


lys 


leu 


asp 


asp 


val 


val 


phe pro leu 


phe 


pro 


gin 


gly 


gly 


gly 


phe 


1261/421 
































CCG CGC AAC 


AAC 


GAC 


GAC 


AAG 


ACC 


TGA 




















pro arg asn 


asn 


asp 


asp 


lys 


thr 


OP A 





















SEQ ID No.39F (continued) 
FIGURE 39F (continued) 



l/l 

GAA GGC CTT GTT GAG CCG GCG 
glu gly leu val glu pro ala 
61/21 

TCG GTT GAA CGT GTA TGT GCC 
ser val glu arg val cys ala 
121/41 

CGT CTC GGC GCT GAC TCA GGC 
arg leu gly ala asp ser gly 
181/61 

GTG GCT TGA GGG GTT GGA ACC 
val ala OPA gly val gly thr 
241/81 

GAT CGA TGC CGC TCG CGA TGA 
asp arg cys arg ser arg OPA 
301/101 

CAG GTG GTC GTC GAC GCG AGT 
gin val val val asp ala ser 









31/11 




CAC 


GAA 


AAC 


GAT CGT 


TGT 


his 


glu 


asn 


asp arg 


cys 








91/31 




CGA 


CGA 


ATT 


GGC GGA 


GCG 


arg 


arg 


ile 


gly gly 


ala 








151/51 




CGC 


GAT 


CAG 


TGC CGA 


GTT 


arg 


asp 


gin 


cys arg 


val 








211/71 




CAG 


AAG 


CAC 


CGG CGC 


TCG 


gin 


lys 


his 


arg arg 


ser 








271/91 




GTT 


CGA 


AGC 


GTG AGA 


GCA 


val 


arg 


ser 


val arg 


ala 








331/111 




GCC 


ATG 


GTG 


GAT C 




ala 


met 


val 


asp 





SEQ ID No ,4 OA 
FIGURE 4 OA 



GTG TAC ATT GGT GTG TAT GGC 
val tyr ile gly val tyr gly 

CGC CAG GGC GCG GGG CTT GAA 
arg gin gly ala gly leu glu 

GGA GAA CTC CGC AAC CGA TGC 
gly glu leu arg asn arg cys 

GCA TGA TGA CGT GCT GGG TGC 
ala OPA OPA arg ala gly cys 

TCG CCC ACT TCG CCG CCG GAG 
ser pro thr ser pro pro glu 



REPLACEMENT SHEET (RULE 26) 



# 



128/185 



1/1 

AAG GCC TTG TTG AGC CGG CGC 
lys ala leu leu ser arg arg 
61/21 

CGG TTG AAC GTG TAT GTG CCC 
arg leu asn val tyr val pro 
121/41 

GTC TCG GCG CTG ACT CAG GCC 
val ser ala leu thr gin ala 
181/61 

TGG CTT GAG GGG TTG GAA CCC 
trp leu glu gly leu glu pro 
241/81 

ATC GAT GCC GCT CGC GAT GAG 
ile asp ala ala arg asp glu 
301/101 

AGG TGG TCG TCG ACG CGA GTG 
arg trp ser ser thr arg val 









31/11 




ACG 


AAA 


ACG 


ATC GTT 


GTG 


thr 


lys 


thr 


ile val 


val 








91/31 




GAC 


GAA 


TTG 


GCG GAG 


CGC 


asp 


glu 


leu 


ala glu 


arg 








151/51 




GCG 


ATC 


AGT 


GCC GAG 


TTG 


ala 


ile 


ser 


ala glu 


leu 








211/71 




AGA 


AGC 


ACC 


GGC GCT 


CGG 


arg 


ser 


thr 


gly ala 


arg 








271/91 




TTC 


GAA 


GCG 


TGA GAG 


CAT 


phe 


glu 


ala 


OPA glu 


his 








331/111 




CCA 


TGG 


TGG 


ATC 




pro 


trp 


trp 


ile 





SEQ ID No. 4 OB 
FIGURE 40B 



TGT ACA TTG GTG TGT ATG GCT 
cys thr leu val cys met ala 

GCC AGG GCG CGG GGC TTG AAC 
ala arg ala arg gly leu asn 

GAG AAC TCC GCA ACC GAT GCG 
glu asn ser ala thr asp ala 

CAT GAT GAC GTG CTG GGT GCG 
his asp asp val leu gly ala 

CGC CCA CTT CGC CGC CGG AGC 
arg pro leu arg arg arg ser 



1/1 

AGG CCT TGT TGA GCC 
arg pro cys OPA ala 
61/21 

GGT TGA ACG TGT ATG 
gly OPA thr cys met 
121/41 

TCT CGG CGC TGA CTC 
ser arg arg OPA leu 
181/61 

GGC TTG AGG GGT TGG 
gly leu arg gly trp 
241/81 

TCG ATG CCG CTC GCG 
ser met pro leu ala 
301/101 

GGT GGT CGT CGA CGC 
gly gly arg arg arg 



GGC GCA CGA AAA CGA 
gly ala arg lys arg 

TGC CCG ACG AAT TGG 
cys pro thr asn trp 

AGG CCG CGA TCA GTG 
arg pro arg ser val 

AAC CCA GAA GCA CCG 
asn pro glu ala pro 

ATG AGT TCG AAG CGT 
met ser ser lys arg 

GAG TGC CAT GGT GGA 
glu cys his gly gly 



31/11 

TCG TTG TGT GTA CAT 
ser leu cys val his 
91/31 

CGG AGC GCG CCA GGG 
arg ser ala pro gly 
151/51 

CCG AGT TGG AGA ACT 
pro ser trp arg thr 
211/71 

GCG CTC GGC ATG ATG 
ala leu gly met met 
271/91 

GAG AGC ATC GCC CAC 
glu ser ile ala his 

TC 



TGG TGT GTA TGG CTC 
trp cys val trp leu 

CGC GGG GCT TGA ACG 
arg gly ala OPA thr 

CCG CAA CCG ATG CGT 
pro gin pro met arg 

ACG TGC TGG GTG CGA 
thr cys trp val arg 

TTC GCC GCC GGA GCA 
phe ala ala gly ala 



SEQ ID NO.40C 
FIGURE 40C 



REPLACEMENT SHEET (RULE 26) 



129/185 



Coding sequence Rv0549c predicted by Cole et al., 1998 (Nature 393:537-544) and 
containing seq40A 



1/1 

gtg aga 
val arg 
61/21 
gat eta 
asp leu 
121/41 
gcg atg 
ala met 
181/61 
cgc gec 
arg ala 
241/81 
ccg gtg 
pro val 
301/101 
etc cgc 
leu arg 
361/121 
acc acc 
thr thr 



gca teg ccc act teg ccg ccg gag 
ala ser pro thr ser pro pro glu 

ctg get cgc act age gat egg tgc 
leu ala arg thr ser asp arg cys 

cac gcg ccg gcg cac ttc gat gca 
his ala pro ala his phe asp ala 

ggc gca etc acc gtt gee tat gtc 
gly ala leu thr val ala tyr val 

act cga cac ggt ctt teg teg ctg 
thr arg his gly leu ser ser leu 

ctg acc gat gec etc tac gtc gag 
leu thr asp ala leu tyr val glu 

gac gaa aga ttg gca cgc gee tgg 
asp glu arg leu ala arg ala trp 



31/11 
cag gtg 
gin val 
91/31 
tct gcg 
ser ala 
151/51 
gag gtg 
glu val 
211/71 
gat gcg 
asp ala 
271/91 
ctt get 
leu ala 
331/111 
ctg gee 
leu ala 
391/131 
ccc teg 
pro ser 



gtc 
val 

gtg 
val 

ttg 
leu 



gca 
ala 



gga 
gly 

gaa 
glu 

get 
ala 



gtc gac 
val asp 

cgc gcg 
arg ala 

teg gcg 
ser ala 

ctg gag 
leu glu 

gcg tgg 
ala trp 

acg gca 
thr ala 

cac gee 
his ala 



gcg agt 
ala ser 

egg ctg 
arg leu 

ctg ggg 
leu gly 

gag ttg 
glu leu 

teg cgc 
ser arg 

ggt ctg 
gly leu 

ate ggc 
ile gly 



gee atg gtg 
ala met val 

get egg acc 
ala arg thr 

cgc atg cag 
arg met gin 

cga cag gtg 
arg gin val 

cgc gac acc 
arg asp thr 

gtg ttg ttg 
val leu leu 

tga * 
OPA 



SEQ ID NO.4 0D 
FIGURE 40D 



ORF according to Cole et al . , 1998 (Nature 393:537-544) and containing Rv0549c 
1/1 31/11 

tga gtt cga age gtg aga gca teg ccc act teg ccg ccg gag cag gtg gtc gtc gac gcg 
OPA val arg ser val arg ala ser pro thr ser pro pro glu gin val val val asp ala 
61/21 91/31 

agt gec atg gtg gat eta ctg' get cgc act age gat egg tgc tct gcg gtg cgc gcg egg 
ser ala met val asp leu leu ala arg thr ser asp arg cys ser ala val arg ala arg 
121/41 151/51 

ctg get egg acc gcg atg cac gcg ccg gcg cac ttc gat gca gag gtg ttg teg gcg ctg 
leu ala arg thr ala met his ala pro ala his phe asp ala glu val leu ser ala leu 
181/61 211/71 

ggg cgc atg cag cgc gee ggc gca etc acc gtt gee tat gtc gat gcg gca ctg gag gag 
gly arg met gin arg ala gly ala leu thr val ala tyr val asp ala ala leu glu glu 
241/81 271/91 

ttg cga cag gtg ccg gtg act cga cac ggt ctt teg teg ctg ctt get gga gcg tgg teg 
leu arg gin val pro val thr arg his gly leu ser ser leu leu ala gly ala trp ser 
301/101 331/111 

cgc cgc gac acc etc cgc ctg acc gat gee etc tac gtc gag ctg gee gaa acg gca ggt 
arg arg asp thr leu arg leu thr asp ala leu tyr val glu leu ala glu thr ala gly 
361/121 391/131 

ctg gtg ttg ttg acc acc gac gaa aga ttg gca cgc gee tgg ccc teg get cac gec arc 
leu val leu leu thr thr asp glu arg leu ala arg ala trp pro ser ala hxs ala xle 



421/141 
ggc tga 
gly OPA 



SEQ ID No. 4 OF 



FIGURE 40F 
REPLACEMENT SHEET (RULE 26) 





130/185 



1/1 31/11 

CCT GGC CGG GAC GCC TAG GTG TAG CCC GCG GCT AGC ACA GGA TAG CCA TTG TTG TGC GGT 
pro gly arg asp ala tyr val AMB pro ala ala ser thr gly AMB pro leu leu cys gly 
61/21 91/31 

AGC GCC AAA ACG ATC AGC CCT TCG CGG ACA TGT CAG CAC CCG CCT TGG CCG GGA GAG CGG 
ser ala lys thr ile ser pro ser arg thr cys gin his pro pro trp pro gly glu arg 
121/41 151/51 

CGT CGT GAC CGT GCT GTC ACC ACG TCT GGT TAG GCT CGG GGC GCG GGC TGG CGC GGA GGA 
arg arg asp arg ala val thr thr ser gly AMB ala arg gly ala gly trp arg gly gly 
181/61 211/71 

GGT GTG TTG CGG AGG AGG TGT GTT GTA GTG GGG ACG GCG GAT CGG CCG TTG GAC GCC TCG 
gly val leu arg arg arg cys val val val gly thr ala asp arg pro leu asp ala ser 
241/81 271/91 
GCC TTG CGG GAC TGG GCA CAC GCC GTC GTC AGC GAT C 
ala leu arg asp trp ala his ala val val ser asp 



SEQ ID No.41A 
FIGURE 41A 



1/1 31/11 

CTG GCC GGG ACG CCT ACG TGT AGC CCG CGG CTA GCA CAG GAT AGC CAT TGT TGT GCG GTA 

leu ala gly thr pro thr cys ser pro arg leu ala gin asp ser his cys cys ala val 

61/21 91/31 

GCG CCA AAA CGA TCA GCC CTT CGC GGA CAT GTC AGC ACC CGC CTT GGC CGG GAG AGC GGC 

ala pro lys arg ser ala leu arg gly his val ser thr arg leu gly arg glu ser gly 

121/41 151/51 



121/41 j.-'j./ 

GTC GTG ACC GTG CTG TCA CCA CGT CTG GTT AGG CTC GGG GCG CGG GCT GGC GCG GAG GAG 

val val thr val leu ser pro arg leu val arg leu gly ala arg ala gly ala glu glu 

181/61 211/71 

GTG TGT TGC GGA GGA GGT GTG TTG TAG TGG GGA CGG CGG ATC GGC CGT TGG ACG CCT CGG 

val cys cys gly gly gly val leu AMB trp gly arg arg ile gly arg trp thr pro arg 

~ . - OT1 /Q1 



241/81 271/91 
CCT TGC GGG ACT GGG CAC ACG CCG TCG TCA GCG ATC 
pro cys gly thr gly his thr pro ser ser ala ile 



SEQ ID No.41B 
FIGURE 41B 



REPLACEMENT SHEET (RULE 26) 




131/185 



1/1 

X / J. 


















31/11 


















TGG CCG 


GGA 


CGC 


CTA 


CGT 


GTA 


GCC 


CGC 


GGC 


TAG CAC 


AGG 


ATA 


GCC 


ATT 


GTT 


GTG 


CGG 


TAG 


trp pro 


gly 


arg 


leu 


arg 


val 


ala 


arg 


gly 


AMB his 


arg 


ile 


ala 


ile 


val 


val 


arg 


AMB 


61/21 
















91/31 


















CGC CAA 


AAC 


GAT 


CAG 


CCC 


TTC 


GCG 


GAC 


ATG 


TCA GCA 


CCC 


GCC 


TTG 


GCC 


GGG 


AGA 


GCG 


GCG 


arg gin 


asn 


asp 


gin 


pro 


phe 


ala 


asp 


met 


ser ala 


pro 


ala 


leu 


ala 


gly 


arg 


ala 


ala 


121/41 


















151/51 


















TCG TGA 


CCG 


TGC 


TGT 


CAC 


CAC 


GTC 


TGG 


TTA 


GGC TCG 


GGG 


CGC 


GGG 


CTG 


GCG 


CGG 


AGG 


AGG 


ser OPA 


pro 


cys 


cys 


his 


his 


val 


trp 


leu 


gly ser 


gly 


arg 


gly 


leu 


ala 


arg 


arg 


arg 


181/61 
















211/71 


















TGT GTT 


GCG 


GAG 


GAG 


GTG 


TGT 


TGT 


AGT 


GGG 


GAC GGC 


GGA 


TCG 


GCC 


GTT 


GGA 


CGC 


CTC 


GGC 


cys val 


ala 


glu 


glu 


val 


cys 


cys 


ser 


gly 


asp gly 


gly 


ser 


ala 


val 


gly 


arg 


leu 


gly 


241/81 
















271/91 


















CTT GCG 


GGA 


CTG 


GGC 


ACA 


CGC 


CGT 


CGT 


CAG 


CGA TC 


















leu ala 


gly 


leu 


gly 


thr 


arg 


arg 


arg 


gin 


arg 



















SEQ ID No.41C 
FIGURE 41C 



Coding sequence Rv2975c predicted by Cole et al, 1998 (Nature 393: 537-544) and 
containing seq41A 

31/11 

ttg gac gcc teg gec ttg egg gac tgg gca cac gec gtc 
leu asp ala ser ala leu arg asp trp ala his ala val 
91/31 

ate gac gag ate aac egg etc aat gtg ttc ccg gtc get 
ile asp glu ile asn arg leu asn val phe pro val ala 
151/51 

atg ctg ttc acc atg cgt gcc gcg gtc gta gaa get gat 
met leu phe thr met arg ala ala val val glu ala asp 
211/71 

gac gcc gaa gac gtg gcg egg gtt gcg gcc get etc gcg 
asp ala glu asp val ala arg val ala ala ala leu ala 



SEQ ID No. 4 ID 
FIGURE 4 ID 



REPLACEMENT SHEET (RULE 26) 



gtg ggg acg gcg gat egg ccg 
val gly thr ala asp arg pro 
61/21 

gtc age gat ctg ate etc cac 
val ser asp leu ile leu his 
121/41 

gac tec gat acc ggc gtc aac 
asp ser asp thr gly val asn 
181/61 

ttg cac gcg aat teg cag get 
leu his ala asn ser gin ala 
241/81 

gcc ggc gcg cgt tga 
ala gly ala arg OPA 



# 



132/185 



ORF according to Cole et al, 1998 (Nature 393: 537-544) and containing Rv2975c 
1/1 31/11 

tag get egg ggc gcg ggc tgg cgc gga gga ggt gtg ttg egg agg agg tgt gtt gta gtg 
AMB ala arg gly ala gly trp arg gly gly gly val leu arg arg arg cys val val val 
61/21 91/31 

ggg acg gcg gat egg ccg ttg gac gcc teg gcc ttg egg gac tgg gca cac gcc gtc gtc 
gly thr ala asp arg pro leu asp ala ser ala leu arg asp trp ala his ala val val 
121/41 151/51 

age gat ctg ate etc cac ate gac gag ate aac egg etc aat gtg ttc ccg gtc get gac 
ser asp leu ile leu his ile asp glu ile asn arg leu asn val phe pro val ala asp 
181/61 211/71 

tec gat acc ggc gtc. aac atg ctg ttc acc atg cgt gcc gcg gtc gta gaa get gat ttg 
ser asp thr gly val asn met leu phe thr met arg ala ala val val glu ala asp leu 
241/81 271/91 

cac gcg aat teg cag get gac gcc gaa gac gtg gcg egg gtt gcg gcc get etc gcg gcc 
his ala asn ser gin ala asp ala glu asp val ala arg val ala ala ala leu ala ala 
301/101 

ggc gcg cgt tga 
gly ala arg OPA 

SEQ ID No.41F 
FIGURE 41F 



sequence Rv 2974C predicted by Cole et al. 
in the same reading frame as Seq41D. The 
one case out of three, a deletion of two 
in 



(Nature 393:537-544) and which may be 
sequencing of this region reveals, in 
nucleotides putting in phase observed 



1/1 

ttg aac gga get cgc ggc 
leu asn gly ala arg gly 
61/21 

gag gtg acc gcg act gcg 
glu val thr ala thr ala 
121/41 

gcc etc ggg gcc gcg ttg 
ala leu gly ala ala leu 
181/61 

gag gtg ccg gga act ate 
glu val pro gly thr ile 
241/81 

gcg cac gag ggg ttg gcc 
ala his glu gly leu ala 
301/101 

gaa aag acc ccc gaa cag 
glu lys thr pro glu gin 











31/11 




aac 


tec 


ggc 


gtg 


ate ctg 


tec 


asn 


ser 


gly 


val 


ile leu 


ser 










91/31 




gcc 


gcc 


gcc 


tct 


ggc gcg 


gta 


ala 


ala 


ala 


ser 


gly ala 


val 










151/51 




tgg 


cgc 


ggc 


gtc 


gag ttg 


gtc 


trp 


arg 


gly 


val 


glu leu 


val 










211/71 




gtc 


teg 


gtg 


ctg 


egg gcc 


gcc 


val 


ser 


val 


leu 


arg ala 


ala 










271/91 




ggt 


gcg 


gtc 


acc 


gcc gcc 


ggt 


gly 


ala 


val 


thr 


ala ala 


gly 








331/111 




ctt 


gac 


gtg 


etc 


gcc gat 


gcg 


leu 


asp 


val 


leu 


ala asp 


ala 



SEQ ID No • 4 IS 
FIGURE 41S 



cag 
gin 


ate 
ile 


ctg 
leu 


cgc 
arg 


ggg 

gly 


ate 
ile 


gca 
ala 


ttg 
leu 


egg 
arg 


gcg 
ala 


gtc 
val 


gac 
asp 


gcc 
ala 


aac 
asn 


gtc 
val 


gcg 
ala 


teg 
ser 


atg 
met 


ggt 
gly 


ggc 
gly 


gtg 
val 


gcc 
ala 


gga 
gly 


gcc 
ala 


gtc 
val 


gac 
asp 


cag 
gin 


tgc 
cys 


gac 
asp 


gcg 
ala 


gcg 
ala 


gtc 
val 


ate 
ile 


gcg 
ala 


ctg 
leu 


ggc 
gly 


gcg 
ala 


gtg 
val 


gac 
asp 


gcc 
ala 


ggc 
gly 


gga 
gly 



REPLACEMENT SHEET (RULE 26) 




133/185 



361/121 
egg ggc ctg 
arg gly leu 
421/141 
egg gcg gtc 
arg ala val 
481/161 
gec ccg caa 
ala pro gin 
541/181 
ttg egg gat 
leu arg asp 
601/201 
age tac tec 
ser tyr ser 
661/221 
gtg ggg cga 
val gly arg 
721/241 
gee ggt ggc 
ala gly gly 
781/261 
gag ctg ttc 
glu leu phe 
841/281 
gee gee gat 
ala ala asp 
901/301 
gtg atg gtg 
val met val 
961/321 
gcg ate ggc 
ala ile gly 
1021/341 
gee gcg ctg 
ala ala leu 
1081/361 
cgt gee gee 
arg ala ala 
1141/381 
tgg gee ggt 
trp ala gly 
1201/401 
gtc gec gac 
val ala asp 
1261/421 
ggc gat ctg 
gly asp leu 
1321/441 
gaa egg cat 
glu arg his 
1381/461 
cgc ggc gac 
arg gly asp 



ctg gtt 
leu val 

tac gaa 
tyr glu 

ttc gag 
phe glu 

cga etc 
arg leu 

gta cac 
val his 

gtt age 
val ser 

tgg acg 
trp thr 

gee ggg 
ala gly 

ate agt 
ile ser 

ctg ccc 
leu pro 

tgg ggc 
trp gly 

gee gtg 
ala val 

ggt get 
gly ala 

acc tgc 
thr cys 

gat gtc 
asp val 

gtg acg 
val thr 

gtg cac 
val his 

gcg ctg 
ala leu 



ctg ctg 
leu leu 

ccc teg 
pro ser 

gtg atg 
val met 

aag gaa 
lys glu 

gtc cac 
val his 

egg ate 
arg ile 

egg ggc 
arg gly 

gag ggc 
glu gly 

gee cac 
ala his 

aat ggc 
asn gly 

gtc gac 
val asp 

cat gac 
his asp 

tec egg 
ser arg 

aag ccg 
lys pro 

gec gcg 
ala ala 

gtg eta 
val leu 

gac cac 
asp his 

ctg ate 
leu ile 



gac gcg 
asp ala 

ccg cgc 
pro arg 

tat ctg 
tyr leu 

ttg ggt 
leu gly 

acc gac 
thr asp 

gtg ate 
val ile 

cgc gee 
arg ala 

gee tgc 
ala cys 

cag ctg 
gin leu 

tat gtg 
tyr val 

gtg gta 
val val 

gcg gee 
ala ala 

cac gga 
his gly 

ggc gac 
gly asp 

gcg gee 
ala ala 

att ggc 
ile gly 

cat cca 
his pro 

ggg gtc 
gly val 



391/131 
ttg cgc tec acc 
leu arg ser thr 

451/151 
gcg ttg ccg acc 
ala leu pro thr 

511/171 
ttg gcg gta tgt 
leu ala val cys 

571/191 
gag teg gtg gec 
glu ser val ala 

631/211 
gac gee ggt gee 
asp ala gly ala 

691/231 
teg gcg etc ggt 
ser ala leu gly 

751/251 
gtg ctg gcg gtc 
val leu ala val 

811/271 
gtg ctg cga ccg 
val leu arg pro 

871/291 
gtg egg gec gtg 
val arg ala val 

931/311 
gec gec gaa gaa 
ala ala glu glu 

991/331 
ccc gtg ccg acc 
pro val pro thr 

1051/351 
cgc cag gee gtc 
arg gin ala val 

1111/371 
teg gtg cgc att 
ser val arg ile 

1171/391 
ggt ctg ggt ate 
gly leu gly ile 

1231/411 
ate ggt ctg gtc 
ile gly leu val 

1291/431 
gee ggc gta acc 
ala gly val thr 

1351/451 
ggc acc gag ctg 
gly thr glu leu 

1411/471 
gag tag 
glu AMB 



ate tgc ggg cag gca cct gec 
ile cys gly gin ala pro ala 

gac acg get acc caa cgc ccc 
asp thr ala thr gin arg pro 

gat get gca gcg gcg gac cag 
asp ala ala ala ala asp gin 

ate gec get get ccg ccc gac 
ile ala ala ala pro pro asp 

gec gtg gaa gee gga ttg gcg 
ala val glu ala gly leu ala 

tec ggg acc age gga ttg ccg 
ser gly thr ser gly leu pro 

gtc gac ggc gac ggt gec gec 
val asp gly asp gly ala ala 

ggt cca gac gee gtg "aca ccg 
gly pro asp ala val thr pro 

gta gac acc ggc gee gcg cac 
val asp thr gly ala ala his 

ctg gtg gee ggg tgt acc gcg 
leu val ala gly cys thr ala 

gga teg atg gtg cag ggg ttg 
gly ser met val gin gly leu 

gac gac ggc tac age atg gec 
asp asp gly tyr ser met ala 

gec acc caa aag gcg ctg acc 
ala thr gin lys ala leu thr 

gcg ggc gac gag gtg ctg ate 
ala gly asp glu val leu ile 

gac ctg ttg ttg gca teg gga 
asp leu leu leu ala ser gly 

gaa gac gtg get gtc gtc ctg 
glu asp val ala val val leu 

gtc tec tac cgc acc gga cac 
val ser tyr arg thr gly his 



SEQ ID No. 4 IS (continued) 
FIGURE 4 IS (continued) 



REPLACEMENT SHEET (RULE 26) 



134/185 

Seq41T comprising seq 41F and seq 41S 
1/1 31/11 

tta ggc teg ggg cgc ggg ctg gcg egg agg agg tgt gtt gcg gag gag gtg tgt tgt agt 
leu gly ser gly arg gly leu ala arg arg arg cys val ala glu glu val cys cys ser 
AMB ala arg gly ala gly trp arg gly gly gly val leu arg arg arg cys val val val 

arg leu gly ala arg ala gly ala glu glu val cys cys gly gly gly val leu AMB trp 
61/21 91/31 • • 

ggg gac ggc gga teg gee gtt gga cgc etc ggc ctt gcg gga ctg ggc aca cgc cgt cgt 
gly asp gly gly ser ala val gly arg leu gly leu ala gly leu gly thr arg arg arg 
gly thr ala asp arg pro leu asp ala ser ala leu arg asp trp ala his ala val val 
gly arg arg ile gly arg trp thr pro arg pro cys gly thr gly his thr pro ser ser 
121/41 151/51 

cag cga tct gat cct cca cat cga cga gat caa ccg get caa tgt gtt ccc ggt cgc tga 
gin arg ser asp pro pro his arg arg asp gin pro ala gin cys val pro gly arg OPA 
ser asp leu ile leu his ile asp glu ile asn arg leu asn val phe pro val ala asp 
ala ile OPA ser ser thr ser thr arg ser thr gly ser met cys ser arg ser leu thr 
181/61 211/71 

etc cga tac egg cgt caa cat get gtt cac cat gcg tgc cgc ggt cgt aga age tga ttt 
leu arg tyr arg arg gin his ala val his his ala cys arg gly arg arg ser OPA phe 
ser asp thr gly val asn met leu phe thr met arg ala ala val val glu ala asp leu 
pro ile pro ala ser thr cys cys ser pro cys val pro arg ser AMB lys leu ile cys 
241/81 271/91 

gca cgc gaa ttc gca ggc tga cgc cga aga cgt ggc gcg ggt tgc ggc cgc tct cgc ggc 
ala arg glu phe ala gly OPA arg arg arg arg gly ala gly cys gly arg ser arg gly ' 
his ala asn ser gin ala asp ala glu asp val ala arg val ala ala ala leu ala ala 
thr arg ile arg arg leu thr pro lys thr trp arg gly leu arg pro leu ser arg pro 
301/101 331/111 

egg cgc gcg ttg aac gga get cgc ggc aac tec ggc gtg ate ctg tec cag ate ctg cgc 
arg arg ala leu asn gly ala arg gly asn ser gly val ile leu ser gin ile leu arg 
gly ala arg OPA thr glu leu ala ala thr pro ala OPA ser cys pro arg ser cys ala 
ala arg val glu arg ser ser arg gin leu arg arg asp pro val pro asp pro ala arg 
361/121 391/131 

ggg ate gca gag gtg acc gcg act gcg gee gee gec tct ggc gcg gta ttg egg gcg gtc 
gly ile ala glu val thr ala thr ala ala ala ala ser gly ala val leu arg ala val 
gly ser gin arg OPA pro arg leu arg pro pro pro leu ala arg tyr cys gly arg ser 
asp arg arg gly asp arg asp cys gly arg arg leu trp arg gly ile ala gly gly arg 
421/141 451/151 

gac gee aac gec etc ggg gee gcg ttg tgg cgc ggc gtc gag ttg gtc gtc gcg teg atg 
asp ala asn ala leu gly ala ala leu trp arg gly val glu leu val val ala ser met 
thr pro thr pro ser gly pro arg cys gly ala ala ser ser trp ser ser arg arg trp 
arg gin arg pro arg gly arg val val ala arg arg arg val gly arg arg val asp gly 
481/161 511/171 

ggt ggc gtg gag gtg ccg gga act ate gtc teg gtg ctg egg gee gee gee gga gee gtc 
gly gly val glu val pro gly thr ile val ser val leu arg ala ala ala gly ala val 
val ala trp arg cys arg glu leu ser ser arg cys cys gly pro pro pro glu pro ser 
trp arg gly gly ala gly asn tyr arg leu gly ala ala gly arg arg arg ser arg arg 
541/181 571/191 

gac cag tgc gcg cac gag ggg ttg gee ggt gcg gtc acc gee gee ggt gac gcg gcg gtc 
asp gin cys ala his glu gly leu ala gly ala val thr ala ala gly asp ala ala val 
thr ser ala arg thr arg gly trp pro val arg ser pro pro pro val thr arg arg ser 
pro val arg ala arg gly val gly arg cys gly his arg arg arg OPA arg gly gly his 
601/201 631/211 

ate gcg ctg gaa aag acc ccc gaa cag ctt gac gtg etc gee gat gcg ggc gcg gtg gac 
ile ala leu glu lys thr pro glu gin leu asp val leu ala asp ala gly ala val asp 
ser arg trp lys arg pro pro asn ser leu thr cys ser pro met arg ala arg trp thr 
arg ala gly lys asp pro arg thr ala OPA arg ala arg arg cys gly arg gly gly arg 
661/221 691/231 

gee ggc gga egg ggc ctg ctg gtt ctg ctg gac gcg ttg cgc tec acc ate tgc ggg cag 
ala gly gly arg gly leu leu val leu leu asp ala leu arg ser thr ile cys gly gin 
pro ala asp gly ala cys trp phe cys trp thr arg cys ala pro pro ser ala gly arg 
arg arg thr gly pro ala gly ser ala gly arg val ala leu his his leu arg ala gly 



SEQ ID No. 4 IT 
FIGURE 41T 
REPLACEMENT SHEET (RULE 26) 
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721/241 751/251 

gca cct gcc egg gcg gtc tac gaa ccc teg ccg cgc gcg ttg ccg acc gac acg get ace 
ala pro ala arg ala val tyr glu pro ser pro arg ala leu pro thr asp thr ala thr 
his leu pro gly arg ser thr asn pro arg arg ala arg cys arg pro thr arg leu pro 
thr cys pro gly gly leu arg thr leu ala ala arg val ala asp arg his gly tyr pro 
781/261 811/271 

caa cgc ccc gcc ccg caa ttc gag gtg atg tat ctg ttg gcg gta tgt gat get gca gcg 
gin arg pro ala pro gin phe glu val met tyr leu leu ala val cys asp ala ala ala 
asn ala pro pro arg asn ser arg OPA cys ile cys trp arg tyr val met leu gin arg 
thr pro arg pro ala ile arg gly asp val ser val gly gly met OPA cys cys ser gly 
841/281 871/291 

gcg gac cag ttg egg gat cga etc aag gaa ttg ggt gag teg gtg gcc ate gcc get get 
ala asp gin leu arg asp arg leu lys glu leu gly glu ser val ala ile ala ala ala 
arg thr ser cys gly ile asp ser arg asn trp val ser arg trp pro ser pro leu leu 
gly pro val ala gly ser thr gin gly ile gly OPA val gly gly his arg arg cys ser 
901/301 931/311 

ccg ccc gac age tac tec gta cac gtc cac acc gac gac gcc ggt gcc gcc gtg gaa gcc 
pro pro asp ser tyr ser val his val his thr asp asp ala gly ala ala val glu ala 
arg pro thr ala thr pro tyr thr ser thr pro thr thr pro val pro pro trp lys pro 
ala arg gin leu leu arg thr arg pro his arg arg arg arg cys arg arg gly ser arg 
961/321 991/331 

gga ttg gcg gtg ggg cga gtt age egg ate gtg ate teg gcg etc ggt tec ggg acc age 
gly leu ala val gly arg val ser arg ile val ile ser ala leu gly ser gly thr ser 
asp trp arg trp gly glu leu ala gly ser OPA ser arg arg ser val pro gly pro ala 
ile gly gly gly ala ser AMB pro asp arg asp leu gly ala arg phe arg asp gin arg 
1021/341 1051/351 

gga ttg ccg gcc ggt ggc tgg acg egg ggc cgc gcc gtg ctg gcg gtc gtc gac ggc gac 
gly leu pro ala gly gly trp thr arg gly arg ala val leu ala val val asp gly asp 
asp cys arg pro val ala gly arg gly ala ala pro cys trp arg ser ser thr ala thr 
ile ala gly arg trp leu asp ala gly pro arg arg ala gly gly arg arg arg arg arg 
1081/361 1111/371 

ggt gcc gcc gag ctg ttc gcc ggg gag ggc gcc tgc gtg ctg cga ccg ggt cca gac gcc 
gly ala ala glu leu phe ala gly glu gly ala cys val leu arg pro gly pro asp ala 
val pro pro ser cys ser pro gly arg ala pro ala cys cys asp arg val gin thr pro 
cys arg arg ala val arg arg gly gly arg leu arg ala ala thr gly ser arg arg arg 
1141/381 1171/391 

gtg aca ccg gcc gcc gat ate agt gcc cac cag ctg gtg egg gcc gtg gta gac acc ggc 
val thr pro ala ala asp ile ser ala his gin leu val arg ala val val asp thr gly 
OPA his arg pro pro ile ser val pro thr ser trp cys gly pro trp AMB thr pro ala 
asp thr gly arg arg tyr gin cys pro pro ala gly ala gly arg gly arg his arg arg 
1201/401 1231/411 

gcc gcg cac gtg atg gtg ctg ccc aat ggc tat gtg gcc gcc gaa gaa ctg gtg gcc ggg 
ala ala his val met val leu pro asn gly tyr val ala ala glu glu leu val ala gly 
pro arg thr OPA trp cys cys pro met ala met trp pro pro lys asn trp trp pro gly 
arg ala arg asp gly ala ala gin trp leu cys gly arg arg arg thr gly gly arg val 
1261/421 1291/431 

tgt acc gcg gcg ate ggc tgg ggc gtc gac gtg gta ccc gtg ccg acc gga teg atg gtg 
cys thr ala ala ile gly trp gly val asp val val pro val pro thr gly ser met val 
val pro arg arg ser ala gly ala ser thr trp tyr pro cys arg pro asp arg trp cys 
tyr arg gly asp arg leu gly arg arg arg gly thr arg ala asp arg ile asp gly ala 
1321/441 1351/451 

cag ggg ttg gcc gcg ctg gcc gtg cat gac gcg gcc cgc cag gcc gtc gac gac ggc tac 
gin gly leu ala ala leu ala val his asp ala ala arg gin ala val asp asp gly tyr 
arg gly trp pro arg trp pro cys met thr arg pro ala arg pro ser thr thr ala thr 
gly val gly arg ala gly arg ala OPA arg gly pro pro gly arg arg arg arg leu gin 
1381/461 1411/471 

age atg gcc cgt gcc gcc ggt get tec egg cac gga teg gtg cgc att gcc acc caa aag 
ser met ala arg ala ala gly ala ser arg his gly ser val arg ile ala thr gin lys 
ala trp pro val pro pro val leu pro gly thr asp arg cys ala leu pro pro lys arg 
his gly pro cys arg arg cys phe pro ala arg ile gly ala his cys his pro lys gly 



SEQ ID No. 4 IT (continued 1) 
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1441/481 1471/491 

gcg ctg acc tgg gcc ggt acc tgc aag ccg ggc gac ggt ctg ggt ate gcg ggc gac gag 
ala leu thr trp ala gly thr cys lys pro gly asp gly leu gly ile ala gly asp glu 
arg OPA pro gly pro val pro ala ser arg ala thr val trp val ser arg ala thr arg 
ala asp leu gly arg tyr leu gin ala gly arg arg ser gly tyr arg gly arg arg gly 
1501/501 1531/511 

gtg ctg ate gtc gcc gac gat gtc gcc gcg gcg gcc ate ggt ctg gtc gac ctg ttg ttg 
val leu ile val ala asp asp val ala ala ala ala ile gly leu val asp leu leu leu 
cys OPA ser ser pro thr met ser pro arg arg pro ser val trp ser thr cys cys trp 
ala asp arg arg arg arg cys arg arg gly gly his arg ser gly arg pro val val gly 
1561/521 1591/531 

gca teg gga ggc gat ctg gtg acg gtg eta att ggc gcc ggc gta acc gaa gac gtg get 
ala ser gly gly asp leu val thr val leu ile gly ala gly val thr glu asp val ala 
his arg glu ala ile trp OPA arg cys OCH leu ala pro ala OCH pro lys thr trp leu 
ile gly arg arg ser gly asp gly ala asn trp arg arg arg asn arg arg arg gly cys 
1621/541 1651/551 

gtc gtc ctg gaa egg cat gtg cac gac cac cat cca ggc acc gag ctg gtc tec tac cgc 
val val leu glu arg his val his asp his his pro gly thr glu leu val ser tyr arg 
ser ser trp asn gly met cys thr thr thr ile gin ala pro ser trp ser pro thr ala 
arg pro gly thr ala cys ala arg pro pro ser arg his arg ala gly leu leu pro his 
1681/561 1711/571 
acc gga cac cgc ggc gac gcg ctg ctg ate ggg gtc gag tag 
thr gly his arg gly asp ala leu leu ile gly val glu AMB 
pro asp thr ala ala thr arg cys OPA ser gly ser ser 
arg thr pro arg arg arg ala ala asp arg gly arg val 



SEQ ID No. 4 IT (continued 2) 
FIGURE 4 IT (continued 2) 



1/1 

GCC GGT AAC GCC GCG 
ala gly asn ala ala 
61/21 

GGC GCC CCG GTC GGC 
gly ala pro val gly 
121/41 

TAT CGA GGT GGA AGC 
tyr arg gly gly ser 
181/61 

TGC TCA TCG GAA CCC 
cys ser ser glu pro 
241/81 

GCT CTT GCA GTG ACC 
ala leu ala val thr 
301/101 

CAA TGC AGT AAG TTT 
gin cys ser lys phe 
361/121 

AAC AAA CGT GGC AAT 
asn lys arg gly asn 



TCC CAG TGC TAT CCG 
ser gin cys tyr pro 

CGC GGC CGG GCT CGA 
arg gly arg ala arg 

GGA CGG TGT TGG GAT 
gly arg cys trp asp 

GCG ACG CAC ACA ATG 
ala thr his thr met 

TGA TGA TGA CAC TCA 
OPA OPA OPA his ser 

ACA CAA ACG GAC TTG 
thr gin thr asp leu 

GCC GGG CAG CCT CTG 
ala gly gin pro leu 



31/11 








TCC GCC 


GGA 


CCG 


CCC 


ser ala 


gly 


pro 


pro 


91/31 








CCC GCT 


CCA 


CCT 


GGC 


pro ala 


pro 


pro 


gly 


151/51 








GCA CGC 


CCA 


ACT 


TGC 


ala arg 


pro 


thr 


cys 


211/71 








CCC GCA 


GCA 


CCG 


CAC 


pro ala 


ala 


pro 


his 


271/91 








CCC CCA 


TAA 


GGC 


TCG 


pro pro 


OCH 


gly 


ser 


331/111 








TAA AAA 


CCT 


GCG 


GAG 


OCH lys 


pro 


ala 


glu 


391/131 








CCC TTG 


TCG 


GAT 


C 


pro leu 


ser 


asp 





GAA ACA TCA GCG GCG 
glu thr ser ala ala 

CAT CAG CGA CCA GGT 
his gin arg pro gly 

CGG CGA TCG CGG CGA 
arg arg ser arg arg 

GAC GGC GCC CCA CCG 
asp gly ala pro pro 

TCG GCT GCG CCT GAG 
ser ala ala pro glu 

GTG GGG TCT ATG GCC 
val gly ser met ala 



SEQ ID No.42A 
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1/1 

CCG GTA ACG CCG CGT CCC AGT 
pro val thr pro arg pro ser 
61/21 

GCG CCC CGG TCG GCC GCG GCC 
ala pro arg ser ala ala ala 
121/41 

ATC GAG GTG GAA GCG GAC GGT 
ile glu val glu ala asp gly 
181/61 

GCT CAT CGG AAC CCG CGA CGC 
ala his arg asn pro arg arg 
241/81 

CTC TTG CAG TGA CCT GAT GAT 
leu leu gin OPA pro asp asp 
301/101 

AAT GCA GTA AGT TTA CAC AAA 
asn ala val ser leu his lys 
361/121 

ACA AAC GTG GCA ATG CCG GGC 
thr asn val ala met pro gly 









Jl/ XX 




I 






CCG CCG 


GAC 


ala 


J_ -L tr 


arg 


LJ 1- \~/ JL 


asp 












ccc 


CTC 


GAC 


CCG CTC 


CAC 


y xy 


leu 


asp 


pro leu 


his 








151/51 




GTT 


GGG 


ATG 


CAC GCC 


CAA 


val 


alv 


met 


his ala 


gin 








211/71 




ACA 


CAA 


TGC 


CCG CAG 


CAC 


thr 


gin 


cys 


pro gin 


his 








271/91 




GAC 


ACT 


CAC 


CCC CAT 


AAG 


asp 


thr 


his 


pro his 


lys 








331/111 




CGG 


ACT 


TGT 


AAA AAC 


CTG 


arg 


thr 


cys 


lys asn 


leu 








391/131 




AGC 


CTC 


TGC 


CCT TGT 


CGG 


ser 


leu 


cys 


pro cys 


arg 



SEQ ID No.42B 
FIGURE 42B 



CGC CCG AAA CAT CAG CGG CGG 
arg pro lys his gin arg arg 

CTG GCC ATC AGC GAC CAG GTT 
leu ala ile ser asp gin val 

CTT GCC GGC GAT CGC GGC GAT 
leu ala gly asp arg gly asp 

CGC ACG ACG GCG CCC CAC CGG 
arg thr thr ala pro his arg 

GCT CGT CGG CTG CGC CTG AGC 
ala arg arg leu arg leu ser 

CGG AGG TGG GGT CTA TGG CCA 
arg arg trp gly leu trp pro 

ATC 
ile 



1/1 

CGG TAA CGC CGC GTC 
arg OCH arg arg val 
61/21 

CGC CCC GGT CGG CCG 
arg pro gly arg pro 
121/41 

TCG AGG TGG AAG CGG 
ser arg trp lys arg 
181/61 

CTC ATC GGA ACC CGC 
leu ile gly thr arg 
241/81 

TCT TGC AGT GAC CTG 
ser cys ser asp leu 
301/101 

ATG CAG TAA GTT TAC 
met gin OCH val tyr 
361/121 

CAA ACG TGG CAA TGC 
gin thr trp gin cys 



CCA GTG CTA TCC GTC 
pro val leu ser val 

CGG CCG GGC TCG ACC 
arg pro gly ser thr 

ACG GTG TTG GGA TGC 
thr val leu gly cys 

GAC GCA CAC AAT GCC 
asp ala his asn ala 

ATG ATG ACA CTC ACC 
met met thr leu thr 

ACA AAC GGA CTT GTA 
thr asn gly leu val 

CGG GCA GCC TCT GCC 
arg ala ala ser ala 



31/11 

CGC CGG ACC GCC CGA 
arg arg thr ala arg 
91/31 

CGC TCC ACC TGG CCA 
arg ser thr trp pro 
151/51 

ACG CCC AAC TTG CCG 
thr pro asn leu pro 
211/71 

CGC AGC ACC GCA CGA 
arg ser thr ala arg 
271/91 

CCC ATA AGG CTC GTC 
pro ile arg leu val 
331/111 

AAA ACC TGC GGA GGT 
lys thr cys gly gly 
391/131 

CTT GTC GGA TC 
leu val gly 



AAC ATC AGC GGC GGG 
asn ile ser gly gly 

TCA GCG ACC AGG TTA 
ser ala thr arg leu 

GCG ATC GCG GCG ATG 
ala ile ala ala met 

CGG CGC CCC ACC GGC 
arg arg pro thr gly 

GGC TGC GCC TGA GCA 
gly cys ala OPA ala 

GGG GTC TAT GGC CAA 
gly val tyr gly gin 



SEQ ID No.42C 
FIGURE 42C 
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Coding sequence Rv2622 predicted by Cole et al. , 1998 (Nature 393:537-544) and 
containing seq42A: 

1/1 31/11 

atg gcc aac aaa cgt ggc aat gcc ggg cag cct ctg ccc ttg teg gat cga gac gac gac 
Met ala asn lys arg gly asn ala gly gin pro leu pro leu ser asp arg asp asp asp 
61/21 91/31 

cac atg cag ggg cac tgg ctg ctg gcc egg ctg ggc aag egg gtg ctg cgt ccc ggc ggc 
his met gin gly his trp leu leu ala arg leu gly lys arg val leu arg pro gly gly 
121/41 151/51 

gtc gaa etc ace egg aca ctg ctg gcc cgc gcc gag gtg ace gac gcc gac gtg etc gag 
val glu leu thr arg thr leu leu ala arg ala glu val thr asp ala asp val leu glu 
181/61 * 211/71 

ctg gca ccg ggc ctg ggc cgc acc gca gcc gaa ate ttg gcc cgc aac ccg egg teg tac 
leu ala pro gly leu gly arg thr ala ala glu ile leu ala arg asn pro arg ser tyr 
241/81 271/91 

gtg ggg gcg gag age gat ccc aac gcg gcc aac ctg gtc cga cac gtt etc gcc ggc cgc 
val gly ala glu ser asp pro asn ala ala asn leu val arg his val leu ala gly arg 
301/101 331/111 

ggc gac gtc egg gtc acc gac gcg gcc gat acc gga tta tec gac gcc age gcc gat gtc 
gly asp val arg val thr asp ala ala asp thr gly leu ser asp ala ser ala asp val 
361/121 391/131 

gtc ate ggc gag gcg atg ctg acc atg caa ggc aac gcg get aaa cac acg ate gtc gcc 
val ile gly glu ala met leu thr met gin gly asn ala ala lys his thr ile val ala 
421/141 451/151 

gag gcg gcg egg gtg ctg agg ccg ggt ggc cgc tac gcg att cac gaa eta gcg ctg gtg 
glu ala ala arg val leu arg pro gly gly arg tyr ala ile his glu leu ala leu val 
481/161 511/171 

ccg gac gac gtc gca gag cag gtc cgc acc gac ctg egg cag teg ctg gcc cgc gcg etc 
pro asp asp val ala glu gin val arg thr asp leu arg gin ser leu ala arg ala leu 
541/181 571/191 

aag gtc aat gcg cgt ccg ctg acc gtt gcg gaa tgg teg cac etc tta gcg ggc cat gga 
lys val asn ala arg pro leu thr val ala glu trp ser his leu leu ala gly his gly 
601/201 631/211 

ctg gtc gtc gaa cac gtt gtc acc get tec atg gcg ttg tta caa ccg cga egg gtg ate 
leu val val glu his val val thr ala ser met ala leu leu gin pro arg arg val ile 
661/221 691/231 

get gac gaa ggc etc ctg ggt gcg ctg egg ttc gcc gga aac ctg etc ate cat cgt gcc 
ala asp glu gly leu leu gly ala leu arg phe ala gly asn leu leu ile his arg ala 
721/241 751/251 

gcg cgt egg cga gtc ctg ttg atg cgc cac aca ttc cgc agg cat cgt gaa cgc ttg aca 

ala arg arg arg val leu leu met arg his thr phe arg arg his arg glu arg leu thr 

781/261 811/271 

gcc gtc gcc att gtc gcg cac aaa ccg cac gtc gat teg tga 

ala val ala ile val ala his lys pro his val asp ser OPA 



SEQ ID No.42D 
FIGURE 42D 
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ORF according to Cole et al . , 1998 (Natu 
1/1 

taa aaa cct gcg gag gtg ggg tct atg gcc 
OCH lys pro ala glu val gly ser met ala 
61/21 

ccc ttg teg gat cga gac gac gac cac atg 
pro leu ser asp arg asp asp asp his met 
121/41 

aag egg gtg ctg cgt ccc ggc ggc gtc gaa 
lys arg val leu arg pro gly gly val glu 
181/61 

gtg acc gac gcc gac gtg etc gag ctg gca 
val thr asp ala asp val leu glu leu ala 
241/81 

ttg gcc cgc aac ccg egg teg tac gtg ggg 
leu ala arg asn pro arg ser tyr val gly 
301/101 

gtc cga cac gtt etc gcc ggc cgc ggc gac 
val arg his val leu ala gly arg gly asp 
361/121 

tta tec gac gcc age gcc gat gtc gtc ate 
leu ser asp ala ser ala asp val val ile 
421/141 

gcg get aaa cac acg ate gtc gcc gag gcg 
ala ala lys his thr ile val ala glu ala 
481/161 

gcg att cac gaa eta gcg ctg gtg ccg gac 
ala ile his glu leu ala leu val pro asp 
541/181 

egg cag teg ctg gcc cgc gcg etc aag gtc 
arg gin ser leu ala arg ala leu lys val 
601/201 

teg cac etc tta gcg ggc cat gga ctg gtc 
ser his leu leu ala gly his gly leu val 
661/221 

ttg tta caa ccg cga egg gtg ate get gac 
leu leu gin pro arg arg val ile ala asp 
721/241 

gga aac ctg etc ate cat cgt gcc gcg cgt 
gly asn leu leu ile his arg ala ala arg 
781/261 

cgc agg cat cgt gaa cgc ttg aca gcc gtc 
arg arg his arg glu arg leu thr ala val 
841/281 
teg tga 
ser OPA 



re 393:537-544) and containing Rv2622 
31/11 

aac aaa cgt ggc aat gcc ggg cag cct ctg 
asn lys arg gly asn ala gly gin pro leu 
91/31 

cag ggg cac tgg ctg ctg gcc" egg ctg ggc 
gin gly his trp leu leu ala arg leu gly 
151/51 

etc acc egg aca ctg ctg gcc cgc gcc gag 
leu thr arg thr leu leu ala arg ala glu 
211/71 

ccg ggc ctg ggc cgc acc gca gcc gaa ate 
pro gly leu gly arg thr ala ala glu ile 
271/91 

gcg gag age gat ccc aac gcg gcc aac ctg 
ala glu ser asp pro asn ala ala asn leu 
331/111 

gtc egg gtc acc gac gcg gcc gat acc gga 
val arg val thr asp ala ala asp thr gly 
391/131 

ggc gag gcg atg ctg acc atg caa ggc aac 
gly glu ala met leu thr met gin gly asn 
451/151 

gcg egg gtg ctg agg ccg ggt ggc cgc tac 
ala arg val leu arg pro gly gly arg tyr 
511/171 

gac gtc gca gag cag gtc cgc acc gac ctg 
asp val ala glu gin val arg thr asp leu 
571/191 

aat gcg cgt ccg ctg acc gtt gcg gaa tgg 
asn ala arg pro leu thr val ala glu trp 
631/211 

gtc gaa cac gtt gtc acc get tec atg gcg 
val glu his val val thr ala ser met ala 
691/231 

gaa ggc etc ctg ggt gcg ctg egg tte gcc 
glu gly leu leu gly ala leu arg phe ala 
751/251 

egg cga gtc ctg ttg atg cgc cac aca tte 
arg arg val leu leu met arg his thr phe 
811/271 

gcc att gtc gcg cac aaa ccg cac gtc gat 
ala ile val ala his lys pro his val asp 



SEQ ID No.42F 
FIGURE 42F 



# 
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1/1 

ate gcg cgt gac ate gat gac 
ile ala arg asp ile asp asp 
61/21 

gtt gtt tea gcg ggc gac gtg 
val val ser ala gly asp val 
121/41 

aag att aag gtc gcg ggc atg 
lys ile lys val ala gly met 
181/61 

gtt ctg cac cgc cat ccg cac 
val leu his arg his pro his 
241/81 

ctg acc ggg ttg gcg gcg ttc 
leu thr gly leu ala ala phe 



31/11 



cag 


ggt 


egg 


ctg tgt 


ctg 


gin 


gly 


arg 


leu cys 


leu 








91/31 




gtg 


cat 


ttg 


cgt taa 


etc 


val 


his 


leu 


arg OCH 


leu 








151/51 




age 


tat 


ccg 


gag aat 


gtc 


ser 


tyr 


pro 


glu asn 


val 








211/71 




tgg 


aat 


cgc 


tta ate 


tgg 


trp 


asn 


arg 


leu ile 


trp 








271/91 




ggg 


tec 


gga 


ttc gtc 


aac 


gly 


ser 


gly 


phe val 


asn 



SEQ ID No.43A 
FIGURE 43A 



gac gtc ggc ggt cga acg gta 
asp val gly gly arg thr val 

gcg egg age tgg cgt ccc caa 
ala arg ser trp arg pro gin 

ctg gee get ggc gag cag gtc 
leu ala ala gly glu gin val 

ccc gtc gtg gtg ctg gtc ttg 
pro val val val leu val leu 

teg aca cct tgg cag cag ate 
ser thr pro trp gin gin ile 



1/1 

teg cgc gtg aca teg atg acc 
ser arg val thr ser met thr 
61/21 

ttg ttt cag egg gcg acg tgg 
leu phe gin arg ala thr trp 
121/41 

aga tta agg teg egg gca tga 
arg leu arg ser arg ala OPA 
181/61 

ttc tgc acc gee ate cgc act 
phe cys thr ala ile arg thr 
241/81 

tga ccg ggt tgg egg cgt teg 
OPA pro gly trp arg arg ser 









31/11 




agg 


gtc 


ggc 


tgt gtc 


tgg 


arg 


val 


gly 


cys val 


trp 








91/31 




tgc 


att 


tgc 


gtt aac 


teg 


cys 


ile 


cys 


val asn 


ser 








151/51 




get 


ate 


egg 


aga atg 


tec 


ala 


ile 


arg 


arg met 


ser 








211/71 




gga 


ate 


get 


taa tct 


ggc 


gly 


ile 


ala 


OCH ser 


gly 








271/91 




ggt 


ccg 


gat 


teg tea 


act 


gly 


pro 


asp 


ser ser 


thr 



SEQ ID No.43B 
FIGURE 43B 



acg teg gcg gtc gaa egg tag 
thr ser ala val glu arg AMB 

cgc gga get ggc gtc ccc aaa 
arg gly ala gly val pro lys 

tgg ccg ctg gcg age agg teg 
trp pro leu ala ser arg ser 

ccg teg tgg tgc tgg tct tgc 
pro ser trp cys trp ser cys 

cga cac ctt ggc age aga tc 
arg his leu gly ser arg 



141/185 

1/1 31/11 

cgc gcg tga cat cga tga cca ggg teg get gtg tct gga cgt egg egg teg aac ggt agt 
arg ala OPA his arg OPA pro gly ser ala val ser gly arg arg arg ser asn gly ser 
61/21 91/31 

tgt ttc age ggg cga cgt ggt gca ttt gcg tta act cgc gcg gag ctg gcg tec cca aaa 
cys phe ser gly arg arg gly ala phe ala leu thr arg ala glu leu ala ser pro lys 
121/41 151/51 

gat taa ggt cgc ggg cat gag eta tec gga gaa tgt cct ggc cgc tgg cga gca ggt cgt 
asp OCH gly arg gly his glu leu ser gly glu cys pro gly arg trp arg ala gly arg 
181/61 211/71 

tct gca ccg cca tec gca ctg gaa teg ctt aat ctg gee cgt cgt ggt get ggt ctt get 
ser ala pro pro ser ala leu glu ser leu asn leu ala arg arg gly ala gly leu ala 
241/81 271/91 

gac egg gtt ggc ggc gtt egg gtc egg att cgt caa etc gac acc ttg gca gca gat c 
asp arg val gly gly val arg val arg ile arg gin leu asp thr leu ala ala asp 

SEQ ID No.43C 
FIGURE 43C 



Coding sequence Rv3278c predicted by Cole et al., 1998 (Nature 393:537-544) and 
containing seq43A: 

1/1 31/11 

atg age tat ccg gag aat gtc ctg gee get ggc gag cag gtc gtt ctg cac cgc cat ccg 
Met ser tyr pro glu asn val leu ala ala gly glu gin val val leu his arg his pro 
61/21 91/31 

cac tgg aat cgc tta ate tgg ccc gtc gtg gtg ctg gtc ttg ctg acc ggg ttg gcg gcg 
his trp asn arg leu ile trp pro val val val leu val leu leu thr gly leu ala ala 
121/41 151/51 

ttc ggg tec gga ttc gtc aac teg aca cct tgg cag cag ate get aag aac gtg att cac 
phe gly ser gly phe val asn ser thr pro trp gin gin ile ala lys asn val ile his 
181/61 211/71 

gcg gtc ate tgg ggg ate tgg ttg gtg ate gtc ggc tgg etc acg ctg tgg cca ttc ctg 
ala val ile trp gly ile trp leu val ile val gly trp leu thr leu trp pro phe leu 
241/81 271/91 

age tgg ctg acc aca cat ttc gtg gtg acc aac egg egg gtg atg ttc egg cat ggt gtg 
ser trp leu thr thr his phe val val thr asn arg arg val met phe arg his gly val 
301/101 331/111 

ctg acc cgc age ggg ate gac ata ccg eta gca egg ate aac age gtg gag ttc egg gac 
leu thr arg ser gly ile asp ile pro leu ala arg ile asn ser val glu phe arg asp 
361/121 391/131 

egg ate ttc gag egg att ttt cgc acc ggg acg ttg att ate gag tec gcg tea caa gat 
arg ile phe glu arg ile phe arg thr gly thr leu ile ile glu ser ala ser gin asp 
421/141 451/151 

ccg etc gag ttc tac aac att ccg cgc ctg egg gag gtg cat gcg ttg ctg tat cac gag 
pro leu glu phe tyr asn ile pro arg leu arg glu val his ala leu leu tyr his glu 
481/161 511/171 
gtt ttc gac acc ctg ggc tec gac gag teg ccc age tga 
val phe asp thr leu gly ser asp glu ser pro ser OPA 



SEQ ID No. 4 3D 
FIGURE 43D 



* 



142/185 



ORF according to Cole et al . , 1998 (Nature 393:537-544) and containing Rv3278c 
1/1 31/11 

taa etc gcg egg age tgg cgt ccc caa aag att aag gtc gcg ggc atg age tat ccg gag 
OCH leu ala arg ser trp arg pro gin lys ile lys val ala gly met ser tyr pro glu 
61/21 91/31 

aat gtc ctg gee get ggc gag cag gtc gtt ctg cac cgc cat ccg cac tgg aat cgc tta 
asn val leu ala ala gly glu gin val val leu his arg his pro his trp asn arg leu 
121/41 , 151/51 

ate tgg ccc gtc gtg gtg ctg gtc ttg ctg ace ggg ttg gcg gcg ttc ggg tec gga ttc 
ile trp pro val val val leu val leu leu thr gly leu ala ala phe gly ser gly phe 
181/61 211/71 

gtc aac teg aca cct tgg cag cag ate get aag aac gtg att cac gcg gtc ate tgg ggg 
val asn ser thr pro trp gin gin ile ala lys asn val ile his ala val ile trp gly 
241/81 271/91 

ate tgg ttg gtg ate gtc ggc tgg etc acg ctg tgg cca ttc ctg age tgg ctg ace aca 
ile trp leu val ile val gly trp leu thr leu trp pro phe leu ser trp leu thr thr 
301/101 331/111 

cat ttc gtg gtg ace aac egg egg gtg atg ttc egg cat ggt gtg ctg ace cgc age ggg 
his phe val val thr asn. arg arg val met phe arg his gly val leu thr arg ser gly 
361/121 391/131 

ate gac ata ccg eta gca egg ate aac age gtg gag ttc egg gac egg ate ttc gag egg 
ile asp ile pro leu ala arg ile asn ser val glu phe. arg asp arg ile phe glu arg 
421/141 451/151 

att ttt cgc acc ggg acg ttg att ate gag tec gcg tea caa gat ccg etc gag ttc tac 
ile phe arg thr gly thr leu ile ile glu ser ala ser gin asp pro leu glu phe tyr 
481/161 511/171 

aac att ccg cgc ctg egg gag gtg cat gcg ttg ctg tat cac gag gtt ttc gac acc ctg 
asn ile pro arg leu arg glu val his ala leu leu tyr his glu val phe asp thr leu 
541/181 

ggc tec gac gag teg ccc age tga 
gly ser asp glu ser pro ser OPA 



SEQ ID No.43F 
FIGURE 43F 



# 



143/185 



1/1 

gcc aag atg gat 
ala lys met asp 
61/21 

acc acc cat ate 
thr thr his ile 
121/41 

ccg atg ggg gtt 
pro met gly val 
181/61 

ttg ccg tat acc 
leu pro tyr thr 
241/81 

ttt aac tec atg 
phe asn ser met 
301/101 

aac ctg caa ate 
asn leu gin ile 
361/121 

cca ggc aaa ggc 
pro gly lys gly 
421/141 
gtg gcg ate 
val ala ile 



gtc tac caa cgc 
val tyr gin arg 

ggt teg gcg ggc 
gly ser ala gly 

tac age ctg gac 
tyr ser leu asp 

caa gtc gga ccc 
gin val gly pro 

cag gtc tgt cag 
gin val cys gin 

ccg cag tac aag 
pro gin tyr lys 

tec gcg ttc ttc 
ser ala phe phe 



31/11 
acc gcc gcc ggc 
thr ala ala gly 

91/31 
atg gcg ccg gaa 
met ala pro glu 

151/51 
tec get ttt ggc 
ser ala phe gly 

211/71 
aat cac tgg tgg 
asn his trp trp 

271/91 
aag tec cag tgc 
lys ser gin cys 

331/111 
cat teg gtc gtg 
his ser val val 

391/131 
ttt cac acc acc 
phe his thr thr 



SEQ ID N0.44A 
FIGURE 44A 



tgg cag ccg etc 
trp gin pro leu 

gcc aag age gga 
ala lys ser gly 

acc gcg ccg aat 
thr ala pro asn 

agt ggc gac gac 
ser gly asp asp 

ccg ttc age acg 
pro phe ser thr 

atg ggc gtc aac 
met gly val asn 

gac ggc ggg ccc 
asp gly gly pro 



aag acc ggt ate 
lys thr gly ile 

tat ccg gcc act 
tyr pro ala thr 

ccc ggt ggc ggg 
pro gly gly gly 

aat age ccc acc 
asn ser pro thr 

gcc gac age gag 
ala asp ser glu 

aag gcc aag gtc 
lys ala lys val 

acc gcg ggt tgt 
thr ala gly cys 



1/1 

cca aga tgg atg tct acc aac 
pro arg trp met ser thr asn 
61/21 

cca ccc ata teg gtt egg egg 
pro pro ile ser val arg arg 
121/41 

cga tgg ggg ttt aca gcc tgg 
arg trp gly phe thr ala trp 
181/61 

tgc cgt ata ccc aag teg gac 
cys arg ile pro lys ser asp 
241/81 

tta act cca tgc agg tct gtc 
leu thr pro cys arg ser val 
301/101 

acc tgc aaa tec cgc agt aca 
thr cys lys ser arg ser thr 
361/121 

cag gca aag get ccg cgt tct 
gin ala lys ala pro arg ser 
421/141 
tgg cga tc 
trp arg 



31/11 

gca ccg ccg ccg get ggc 
ala pro pro pro ala gly 
91/31 

gca tgg cgc egg aag cca 
ala trp arg arg lys pro 
151/51 

act ccg ctt ttg gca ccg 
thr pro leu leu ala pro 

211/71 
cca ate act ggt gga gtg 
pro ile thr gly gly val 

271/91 
aga agt ccc agt gcc cgt 
arg ser pro ser ala arg 

331/111 
age att egg teg tga tgg 
ser ile arg ser OPA trp 

391/131 
tct ttc aca cca ccg acg 
ser phe thr pro pro thr 



SEQ ID No,44B 
FIGURE 44B 



age cgc tea aga ccg gta tea 
ser arg ser arg pro val ser 

aga gcg gat ate egg cca etc 
arg ala asp ile arg pro leu 

cgc cga ate ccg gtg gcg ggt 
arg arg ile pro val ala gly 

gcg acg aca ata gcc cca cct 
ala thr thr ile ala pro pro 

tea gca egg ccg aca gcg aga 
ser ala arg pro thr ala arg 

gcg tea aca agg cca agg tec 
ala ser thr arg pro arg ser 

gcg ggc cca ccg egg gtt gtg 
ala gly pro pro arg val val 



# 



144/185 



1/1 

caa gat gga tgt eta cca acg 
gin asp gly cys leu pro thr 
61/21 

cac cca tat egg ttc ggc ggg 
his pro tyr arg phe gly gly 
121/41 

gat ggg ggt tta cag cct gga 
asp gly gly leu gin pro gly 
181/61 

gec gta tac cca agt egg acc 
ala val tyr pro ser arg thr 
241/81 

taa etc cat gca ggt ctg tea 
OCH leu his ala gly leu ser 
301/101 

cct gca aat ccc gca gta caa 
pro ala asn pro ala val gin 
361/121 

agg caa agg etc cgc gtt ctt 
arg gin arg leu arg val leu 
421/141 
ggc gat c 
gly asp 



31/11 

cac cgc cgc egg ctg gca 
his arg arg arg leu ala 
91/31 

cat ggc gee gga age caa 
his gly ala gly ser gin 
151/51 

etc cgc ttt tgg cac cgc 
leu arg phe trp his arg 

211/71 
caa tea ctg gtg gag tgg 
gin ser leu val glu trp 

271/91 
gaa gtc cca gtg ccc gtt 
glu val pro val pro val 

331/111 
gca ttc ggt cgt gat ggg 
ala phe gly arg asp gly 

391/131 
ctt tea cac cac cga egg 
leu ser his his arg arg 



SEQ ID No.44C 
FIGURE 44C 



gec get caa gac egg tat cac 
ala ala gin asp arg tyr his 

gag egg ata tec ggc cac tec 
glu arg ile ser gly his ser 

gec gaa tec egg tgg egg gtt 
ala glu ser arg trp arg val 

cga cga caa tag ccc cac ctt 
arg arg gin AMB pro his leu 

cag cac ggc cga cag cga gaa 
gin his gly arg gin arg glu 

cgt caa caa ggc caa ggt ccc 
arg gin gin gly gin gly pro 

egg gec cac cgc ggg^ ttg tgt 
arg ala his arg gly leu cys 



145/185 



Coding sequence Rv0309 predicted by Cole et al . , 1998 (Nature 393:537-544) and 
containing Seq44A: 

1/1 31/11 

atg age cga etc eta get ttg ctg tgc get gcg gta tgc acg ggc tgc gtt get gtg gtt 
Met ser arg leu leu ala leu leu cys ala ala val cys thr gly cys val ala val val 
61/21 91/31 

etc gcg cca gtg age ctg gee gtc gtc aac ceg tgg ttc gcg aac teg gtc ggc aat gee 
leu ala pro val ser leu ala val val asn pro trp phe ala asn ser val gly asn ala 
121/41 151/51 

act cag gtg gtt teg gtg gtg gga acc ggc ggt teg acg gec aag atg gat gtc tac caa 
thr gin val val ser val val gly thr gly gly ser thr ala lys met asp val tyr gin 
181/61 211/71 

cgc acc gec gec ggc tgg cag ceg etc aag acc ggt ate acc acc cat ate ggt teg gcg 
arg thr ala ala gly trp gin pro leu lys thr gly ile thr thr his ile gly ser ala 
241/81 271/91 

ggc atg gcg ceg gaa gee aag age gga tat ceg gee act ceg atg ggg gtt tac age ctg 
gly met ala pro glu ala lys ser gly tyr pro ala thr pro met gly val tyr ser leu 
301/101 331/111 

gac tec get ttt ggc acc gcg ceg aat ccc ggt ggc ggg ttg ceg tat acc caa gtc gga 
asp ser ala phe gly thr ala pro asn pro gly gly gly leu pro tyr thr gin val gly 
361/121 391/131 

ccc aat cac tgg -tgg agt ggc gac gac aat age ccc acc ttt aac tec atg cag gtc tgt 
pro asn his trp trp ser gly asp asp asn ser pro thr phe asn ser met gin val cys 
421/141 451/151 

cag aag tec cag tgc ceg ttc age acg gee gac age gag aac ctg caa ate ceg cag tac 
gin lys ser gin cys pro phe ser thr ala asp ser glu asn leu gin ile pro gin tyr 
481/161 511/171 

aag cat teg gtc gtg atg ggc gtc aac aag gee aag gtc cca ggc aaa ggc tec gcg ttc 
lys his ser val val met gly val asn lys ala lys val pro gly lys gly ser ala phe 
541/181 571/191 

ttc ttt cac acc acc gac ggc ggg ccc acc gcg ggt tgt gtg gcg ate gac gat gee acg 
phe phe his thr thr asp gly gly pro thr ala gly cys val ala ile asp asp ala thr 
601/201 631/211 

ctg gtg cag ate ate cgt tgg ctg egg cct ggt gcg gtg ate gcg ate gec aag taa 
leu val gin ile ile arg trp leu arg pro gly ala val ile ala ile ala lys OCH 



SEQ ID NO-44D 
FIGURE 44D 



# 



146/185 



ORF according to Cole et al . , 1998 (Nature 393:537-544) and containing Rv0309 
1/1 31/11 

tga gcg atg age cga etc eta get ttg ctg tgc get gcg gta tgc acg ggc tgc gtt get 
OPA ala met ser arg leu leu ala leu leu cys ala ala val cys thr gly cys val ala 
61/21 91/31 

gtg gtt etc gcg cca gtg age ctg gee gtc gtc aac ccg tgg ttc gcg aac teg gtc ggc 
val val leu ala pro val ser leu ala val val asn pro trp phe ala asn ser val gly 
121/41 151/51 

aat gee act cag gtg gtt teg gtg gtg gga ace ggc ggt teg acg gee aag atg gat gtc 
asn ala thr gin val val ser val val gly thr gly gly ser thr ala lys met asp val 
181/61 211/71 

tac caa cgc acc gee gee ggc tgg cag ccg etc aag ace ggt ate acc acc cat ate ggt 
tyr gin arg thr ala ala gly trp gin pro leu lys thr gly ile thr thr his ile gly 
241/81 271/91 

teg gcg ggc atg gcg ccg gaa gee aag age gga tat ccg gee act ccg atg ggg gtt tac 
ser ala gly met ala pro glu ala lys ser gly tyr pro ala thr pro met gly val tyr 
301/101 331/111 

age ctg gac tec get ttt ggc acc gcg ccg aat ccc ggt ggc ggg ttg ccg tat acc caa 
ser leu asp ser ala phe gly thr ala pro asn pro gly gly gly leu pro tyr thr gin 
361/121 391/131 

gtc gga ccc aat cac tgg tgg agt ggc gac gac aat age ccc acc ttt aac tec atg cag 
val gly pro asn his trp trp ser gly asp asp asn ser pro thr phe asn ser met gin 
421/141 451/151 

gtc tgt cag aag tec cag tgc ccg ttc age acg gee gac age gag aac ctg caa ate ccg 
val cys gin lys ser gin cys pro phe ser thr ala asp ser glu asn leu gin ile pro 
481/161 511/171 

cag tac aag cat teg gtc gtg atg ggc gtc aac aag gee aag gtc cca ggc aaa ggc tec 
gin tyr lys his ser val val met gly val asn lys ala lys val pro gly lys gly ser 
541/181 571/191 

gcg ttc ttc ttt cac acc acc gac ggc ggg ccc acc gcg ggt tgt gtg gcg ate gac gat 
ala phe phe phe his thr thr asp gly gly pro thr ala gly cys val ala ile asp asp 
601/201 631/211 

gee acg ctg gtg cag ate ate cgt tgg ctg egg cct ggt gcg gtg ate gcg ate gee aag 

ala thr leu val gin ile ile arg trp leu arg pro gly ala val ile ala ile ala lys 

661/221 

taa 

OCH 



SEQ ID No.44F 
FIGURE 44F 



# 



147/185 



Cloned fragment fused with phoA 
1/1 

gat etc ccc gga cac cag gtc ate egg cga 
asp leu pro gly his gin val ile arg arg 
61/21 

tec ggt age cag agg cac cag cat cag caa 
ser gly ser gin arg his gin his gin gin 
121/41 

ggt cct tgc cac teg cga tec ttg gga tga 
gly pro cys his ser arg ser leu gly OPA 
181/61 

ate gtg cca gac egg gca tgc cgc gtc ggc 
ile val pro asp arg ala cys arg val gly 
241/81 

gtg cga ccc agg atg gcg aat get egg ggg 
val arg pro arg met ala asn ala arg gly 
301/101 

teg gtg aag ccc aac egg egg tac aac cgc 
ser val lys pro asn arg arg tyr asn arg 
361/121 

ggt gtg gag age agg acg ttg tec teg teg 
gly val glu ser arg thr leu ser ser ser 
421/141 

tec ccg agg cca egg cct tga gcg egg gga 
ser pro arg pro arg pro OPA ala arg gly 
481/161 

ctg gtc ate agt egg gcg ate get agg cgc 
leu val ile ser arg ala ile ala arg arg 
541/181 

ace tgc tgt tgc cac cac tgg ccg ggc gec 
thr cys cys cys his his trp pro gly ala 
601/201 

ggc gcg ttg etc agt teg gcg gee gac ggc 
gly ala leu leu ser ser ala ala asp gly 
661/221 

teg get gee gtt acc teg acg gee gcg acc 
ser ala ala val thr ser thr ala ala thr 
721/241 

age cac att ggg gcg cgc aaa gtc teg gtg 
ser his ile gly ala arg lys val ser val 
781/261 

acc gtc agg gca tea ccg agg egg cgc tec 
thr val arg ala ser pro arg arg arg ser 
841/281 

ate gee aac gcg egg tgt cct cct cat gtg 
ile ala asn ala arg cys pro pro his val 
901/301 

egg aca age cga tga ggc cgc ccg cgc tgg 
arg thr ser arg OPA gly arg pro arg trp 



31/11 

gat ggt gat cga ggc teg gac ccg cag gca 
asp gly asp arg gly ser asp pro gin ala 
91/31 

cat cgc gat ggc cag cat gee gcg ccg teg 
his arg asp gly gin his ala ala pro ser 
151/51 

egg tgg ggc ata get age gcg cac cag gtc 
arg trp gly ile ala ser ala his gin val 
211/71 

aag ctg teg ggc gcg ggt tag age ggt age 
lys leu ser gly ala gly AMB ser gly ser 
271/91 

tea ccg gcg aag tgg tag ccg egg atg atg 
ser pro ala lys trp AMB pro arg met met 
331/111 

cac gee cga ttg tec tea ccg ttg gtc tec 
his ala arg leu ser ser pro leu val ser 
391/131 

cga ccg get age agt egg egg gee aac gee 
arg pro ala ser ser arg arg ala asn ala 
451/151 

agg atg tgc aat tea gtc aac teg aag tag 
arg met cys asn ser val asn ser lys AMB 
511/171 

gga aag ccg ctg cgt tgc aag ccc agt acc 
gly lys pro leu arg cys lys pro ser thr 
571/191 

ccg gga tag ccg tac gee act ccg age att 
pro gly AMB pro tyr ala thr pro ser ile 
631/211 

age gec gtg gtg teg gcg gee teg gee tgt 
ser ala val val ser ala ala ser ala cys 
691/231 

gee tgc cag ccg cgc cgc egg atg tgc tec 
ala cys gin pro arg arg arg met cys ser 
751/251 

ccc ctg ggg tag cgc ate gcg teg aca tac 
pro leu gly AMB arg ile ala ser thr tyr 
811/271 

ata teg ctg ggc ggc aga teg atg agg aat 
ile ser leu gly gly arg ser met arg asn 
871/291 

atg aac cga tgc gtg ctt gcg cac cag tat 
met asn arg cys val leu ala his gin tyr 
931/311 

acg ggg ctt gta gcg tat ggc cgt ttc cgc 
thr gly leu val ala tyr gly arg phe arg 



SEQ ID No , 45ZA 
FIGURE 45ZA 



148/185 



961/321 

tea get cgt cgc tgc ggc gee gec ggg ata 
ser ala arg arg cys gly ala ala gly ile 
1021/341 

gat tga cct cgt ate ate tga gtt agt tgc 
asp OPA pro arg ile ile OPA val ser cys 
1081/361 

att acg tga cag tct gtc ggc aag gag gga 
ile thr OPA gin ser val gly lys glu gly 
1141/381 

atg ctt gac cag ate gag age get etc tac 
met leu asp gin ile glu ser ala leu tyr 
1201/401 

cgt ggc ggg ggc ttc cgc gca ccg acc gcg 
arg gly gly gly phe arg ala pro thr ala 
1261/421 

ate ate ggt ctg ggg atg ttg gtt tec ggc 
ile ile gly leu gly met leu val ser gly 
1321/441 

ttc ccg ata etc age gtt ttc ggt ttt gtc 
phe pro ile leu ser val phe gly phe val 
1381/461 

acc ggt cct egg ttg tec ggc agg atg gat 
thr gly pro arg leu ser gly arg met asp 
1441/481 

cag cgt cgt acc aag ggg gee ggg ggc tea 
gin arg arg thr lys gly ala gly gly ser 



991/331 

gaa teg ccc gcg aac cag tgg tac ggc gca 
glu ser pro ala asn gin trp tyr gly ala 
1051/351 

ccg cgc aat ggg cat ccg cgt gtt ate ggt 
pro arg asn gly his pro arg val ile gly 
1111/371 

cgc atg cca etc tec gat cat gag cag egg 
arg met pro leu ser asp his glu gin arg 
1171/391 

gee gaa gat ccc aag ttc gca teg agt gtc 
ala glu asp pro lys phe ala ser ser val 
1231/411 

egg egg cgc ctg cag ggc gcg gcg ttg ttc 
arg arg arg leu gin gly ala ala leu phe 
1291/431 

gtg gcg ttc aaa gag acc atg ate gga agt 
val ala phe lys glu thr met ile gly ser 
1351/451 

gtg atg ttc ggt ggt gtg gtg tat gee ate 
val met phe gly gly val val tyr ala ile 
1411/471 

cgt ggc gga teg get get ggg get teg cgc 
arg gly gly ser ala ala gly ala ser arg 
1471/491 

ttc acc age cgt atg gaa gat c 
phe thr ser arg met glu asp 



SEQ ID No.45ZA (continued) 
FIGURE 45ZA (continued) 



149/185 

fragment seq45ZA shifted minus 1 for the. reading frame 
1/1 31/11 

ate tec ccg gac ace agg tea tec ggc gag atg gtg ate gag get egg ace cgc agg cat 
ile ser pro asp thr arg ser ser gly glu met val ile glu ala arg thr arg arg his 
61/21 91/31 

ccg gta gee aga ggc acc age ate age aac ate gcg atg gee age atg ccg cgc cgt egg 
pro val ala arg gly thr ser ile ser asn ile ala met ala ser met pro arg arg arg 
121/41 151/51 

gtc ctt gee act cgc gat cct tgg gat gac ggt ggg gca tag eta gcg cgc acc agg tea 
val leu ala thr arg asp pro trp asp asp gly gly ala AMB leu ala arg thr arg ser 
181/61 211/71 

teg tgc cag acc ggg cat gee gcg teg gca age tgt egg gcg egg gtt aga gcg gta gcg 
ser cys gin thr gly his ala ala ser ala ser cys arg ala arg val arg ala val ala 
241/81 271/91 

tgc gac cca gga tgg cga atg etc ggg ggt cac egg cga agt ggt age cgc gga tga tgt 
cys asp pro gly trp arg met leu gly gly his arg arg ser gly ser arg gly OPA cys 
301/101 331/111 

egg tga age cca acc ggc ggt aca acc gee acg ccc gat tgt cct cac cgt tgg tct ccg 
arg OPA ser pro thr gly gly thr thr ala thr pro asp cys pro his arg trp ser pro 
361/121 391/131 

gtg tgg aga gca gga cgt tgt cct cgt cgc gac egg eta gca gtc ggc ggg cca acg cct 
val trp arg ala gly arg cys pro arg arg asp arg leu ala val gly gly pro thr pro 
421/141 451/151 

ccc cga ggc cac ggc ctt gag cgc ggg gaa gga tgt gca att cag tea act cga agt age 
pro arg gly his gly leu glu arg gly glu gly cys ala ile gin ser thr arg ser ser 
481/161 511/171 

tgg tea tea gtc ggg cga teg eta ggc gcg gaa age cgc tgc gtt gca age cca gta cca 
trp ser ser val gly arg ser leu gly ala glu ser arg cys val ala ser pro val pro 
541/181 571/191 

cct get gtt gee acc act ggc egg gcg ccc egg gat age cgt acg cca etc cga gca ttg 
pro ala val ala thr thr gly arg ala pro arg asp ser arg thr pro leu arg ala leu 
601/201 631/211 

gcg cgt tgc tea gtt egg egg ccg acg gca gcg ccg tgg tgt egg egg cct egg cct gtt 
ala arg cys ser val arg arg pro thr ala ala pro trp cys arg arg pro arg pro val 
661/221 691/231 

egg ctg ccg tta cct cga egg ccg cga ccg cct gee age cgc gee gee gga tgt get cca 
arg leu pro leu pro arg arg pro arg pro pro ala ser arg ala ala gly cys ala pro 
721/241 751/251 

gee aca ttg ggg cgc gca aag tct egg tgc ccc tgg ggt age gca teg cgt cga cat aca 
ala thr leu gly arg ala lys ser arg cys pro trp gly ser ala ser arg arg his thr 
781/261 811/271 

ccg tea ggg cat cac cga ggc ggc get cca tat cgc tgg gcg gca gat cga tga gga ata 
pro ser gly his his arg gly gly ala pro tyr arg trp ala ala asp arg OPA gly ile 
841/281 871/291 

teg cca acg cgc ggt gtc etc etc atg tga tga acc gat gcg tgc ttg cgc acc agt ate 
ser pro thr arg gly val leu leu met OPA OPA thr asp ala cys leu arg. thr ser lie 
901/301 . 931/311 „ + • 

gga caa gee gat gag gee gee cgc get gga egg ggc ttg tag cgt atg gee gtt tec get 
qly gin ala asp glu ala ala arg ala gly arg gly leu AMB arg met ala val ser ala 
961/321 991/331 

cag etc gtc get gcg gcg ccg ccg gga tag aat cgc ccg cga acc agt ggt acg gcg cag 
gin leu val ala ala ala pro pro gly AMB asn arg pro arg thr ser gly thr ala gin 



SEQ ID No.45ZB 
FIGURE 45ZB 



I 



150/185 



1021/341 

att gac etc gta tea 
ile asp leu val ser 

1081/361 
tta cgt gac agt ctg 
leu arg asp ser leu 
1141/381 

tgc ttg acc aga teg 
cys leu thr arg ser 
1201/401 

gtg gcg ggg get tec 
val ala gly ala ser 
1261/421 

tea teg gtc tgg gga 
ser ser val trp gly 
1321/441 

tec cga tac tea gcg 
ser arg tyr ser ala 
1381/461 

ccg gtc etc ggt tgt 
pro val leu gly cys 
1441/481 

age gtc gta cca agg 
ser val val pro arg 



tct gag tta gtt gee 
ser glu leu val ala 

teg gca agg agg gac 
ser ala arg arg asp 

aga gcg etc tct acg 
arg ala leu ser thr 

gcg cac cga ccg cgc 
ala his arg pro arg 

tgt tgg ttt ccg gcg 
cys trp phe pro ala 

ttt teg gtt ttg teg 
phe ser val leu ser 

ccg gca gga tgg ate 
pro ala gly trp ile 

ggg ccg ggg get cat 
gly pro gly ala his 



1051/351 

cgc gca atg ggc ate 
arg ala met gly ile 

1111/371 
gca tgc cac tct ccg 
ala cys his ser pro 
1171/391 

ccg aag ate cca agt 
pro lys ile pro ser 
1231/411 

ggc ggc gee tgc agg 
gly gly ala cys arg 
1291/431 

tgg cgt tea aag aga 
trp arg ser lys arg 
1351/451 

tga tgt teg gtg gtg 
OPA cys ser val val 
1411/471 

gtg gcg gat egg ctg 
val ala asp arg leu 
1471/491 

tea cca gee gta tgg 
ser pro ala val trp 



cgc gtg tta teg gta 
arg val leu ser val 

ate atg age age gga 
ile met ser ser gly 

teg cat cga gtg tec 
ser his arg val ser 

gcg egg cgt tgt tea 
ala arg arg cys ser 

cca tga teg gaa gtt 
pro OPA ser glu val 

tgg tgt atg cca tea 
trp cys met pro ser 

ctg ggg ctt cgc gee 
leu gly leu arg ala 

aag ate 
lys ile 



SEQ ID No.45ZB (continued) 
FIGURE 45ZB (continued) 



C"T /"OUT IT ">#Ci 




151/185 

fragment seq4 5ZA shifted minus 2 for the reading frame 

2 2 2 2 2 2 £ 2 S 2 3S 2 3 £ !S 2 S 2 £ 



61/21 _ „ ca aca tcq'cga tag cca gca tgc cgc gcc gtc ggg 

2 ss 2 2 2 2 2 2 2 2 22 .5 p» - - .« - - 

121/41 151/bl 
g cca etc gcg ate ctt ggg atg acg 
.. i => i =, iip leu civ met thr 

211/71 



£ £ 2 £ 2 £ £ 55 2 2 22 2 = 2 2 S 2 2 2 
SS 2 2 2 2 2 2 2 2 §2 - 2 2 £ 2 2 2 2 
S 2 2 2 2 2 2 2 2 2 22 2 2 2 2 2 2 2 2 

301/101 att tc ctc acc gtt ggt etc egg 

2 2 2 2 2 2 2 2 2 p» 2p« ^ 
22 2 2 2 2 2 £ 2 2 22 3 2 2 S 2 2 2 £ 
22 2 3 2 £ 2 2 2 £ 22 2 2 2 2 £ 2 2 2 
2 2 2 2 2 2 2 2 2 2 £2 2 2 £ 2 2 2 2 2 
£ £ £ 2 2 £ 2 2 2 2 2/2 2 2 2 2 2 2 2 2 
SS 2 2 2 2 2 2 2 2 2 2 2 2 2 2 £ 2 £ 2 



S3 2 2 £ 2 2 2 2 2 £2 2 2 2 2 2 2 £ 2 
2 2 2 2 2 2 2 £ 2 2 2 2 2 2 2 2 2 2 £ 2 



811/271 ^ _ 

2 2 2 £ 2 2 2 2 £ 2 2 2 2 2 2 £ 2 2 2 2 
2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 
2 S 2 2 3 2 2 2 £ 2 2 2 2 2 2 2 2 2 2 £ 



SEQ ID NO-45ZC 
FIGURE 45ZC 





152/185 

ill 2 2 3 3 S S =S S £ K 5 5= £ » VI 5 S £ 

5eL 1051/351 

S - s s s a a r~ sgs s = s ss ss ss s 
-as s s s s: s ss a 22 s £ = s = a s s s 
sa's s s ss s s s: s |=j~ - - - - - - - 

33S S SZ 5 S K 5 S = = - = S 5 !S S S Si 

1261/421 1291/431 



cat egg tct ggg gat gtt ggt ttc egg cgt ggc gtt caa aga gac cat gat egg aag ttt 
Ts III sir ITy asp 5.1 gly phe arg arg ^jf^ 1 " «^ asp h» asp arg lys phe 

lllKlV'act. caq cat ttt egg ttt tgt cgt gat gtt egg tgg tgt ggt gta tgc cat cac 
pro Tsl Tr gtn IT, phe a£ phe eys arg «P Jjl«* trp eys gly val cys h» hx. 

ca^tcc'tcg gtt gte egg eag gat gga teg tgg egg ate ggc tgc tgg ggc ttc geg cea 
III ser ser III val arg gin asp gly ser trpjrg^ile gly eys trp gly phe ala pro 

gc^te^tae caa ggg gge egg ggg etc att cac cag ecg tat gga aga tc 
111 ser tyr gin gly gly arg gly leu ile his gin pro tyr gly arg 

SEQ ID No.45ZC (continued 1) 
FIGURE 45ZC (continued 1) 



ORF de seq 45ZA directement 
cag tct gtc ggc aag gag gga 
gin ser val gly lys glu gly 
1141/381 

atg ctt gac cag ate gag age 
met leu asp gin ile glu ser 
1201/401 

cgt ggc ggg ggc ttc cgc gca 
arg gly gly gly P he ar 9 ala 
1261/421 

ate ate ggt ctg ggg atg ttg 
ile ile gly leu gly met leu 
1321/441 

ttc ecg ata etc age gtt ttc 
phe pro ile leu ser val phe 
1381/461 

ace ggt cct egg ttg tec ggc 
thr gly pro arg leu ser gly 
1441/481 

cag cgt cgt ace aag ggg gec 
gin arg arg thr lys gly ala 



en fusion avec phoA 
cgc atg cca etc tec gat 
arg met pro leu ser asp 

1171/391 
get etc tac gec gaa gat 
ala leu tyr ala glu asp 

1231/411 
ecg acc gcg egg egg cgc 
pro thr ala arg arg arg 

1291/431 
gtt tec ggc gtg gcg ttc 
val ser gly val ala phe 

1351/451 
ggt ttt gtc gtg atg ttc 
gly phe val val met phe 

1411/471 
agg atg gat cgt ggc gga 
arg met asp arg gly gly 

1471/491 
ggg ggc tea ttc acc age 

gly gly ser P he tnr ser 
SEQ ID No.45A 



cat gag cag egg 
his glu gin arg 

ccc aag ttc gca teg agt gtc 
pro lys phe ala ser ser val 

ctg cag ggc gcg gcg ttg ttc 
leu gin gly ala ala leu phe 

aaa gag acc atg ate gga agt 
lys glu thr met ile gly ser 

ggt ggt gtg gtg tat gec ate 

gly gly val val tvr aia ile 

teg get get ggg get teg cgc 
ser ala ala gly ala ser arg 

cgt atg gaa gat c 
arg met glu asp 



FIGURE 45A 



# 



153/185 



Sequence Rv2169c predicted by Cole et al . , 1998 (Nature 393:537-544) and 

containing Seq45A 



i/i 31/11 



ata cca etc tec gat cat gag cag egg atg ctt gac cag ate gag age get etc tac gee 
nil pro Su ser lip his glu gin arg met leu^sp gin ile glu ser ala leu tyr ala 

gafgat ecc aag tte gea teg agt gte egt gge ggg gge tte cgc gea eeg ace gcg egg 
glu asp pro lys phe ala ser ser val arg 9^ gl Y P he arg ala pr ° thr ala * rg 

cag' cgc etg cag gge geg gcg ttg tte ate ate ggt etg ggg atg ttg gtt tee gge gtg 
arg arg leu gin gly ala ala leu phe ile i^gly leu met leU Val Ser gly Val 

aeg'tte aaa gag aec atg ate gga agt tte eeg ata etc age gtt tte ggt ttt gte gtg 
III phe lys glu thr met ile gly ser phe pro ile leu ser val phe gly phe val val 

r 271/91 
Itl'llc ggt ggt gtg gtg tat gee ate acc ggt cct egg ttg tec gge agg atg gat egt 
me? Se gly gly vai vai tyr ala ile thr gly pro arg leu ser gly arg met asp arg 



301/101 331/111 
qqc gga teg get get ggg get teg cgc cag egt egt acc aag ggg gee ggg gge tea tte 
lly gly ser ala ala gly ala ser arg gin arg arg thr lys gly ala gly gly ser phe 

361/121 391/131 

acc age egt atg gaa gat egg tte egg cgc cgc tte gac gag taa 

thr ser arg met glu asp arg phe arg arg arg phe asp glu OCH 



SEQ ID No.45D 
FIGURE 45D 



ORF according to Cole et al., 1998 (Nature 393:537-544) and containing Rv2 169c 



1/1 31/11 



taa caa tct gte gge aag gag gga cgc atg cca etc tec gat cat gag cag egg atg ctt 
oil gin ser val gxy lys glu gly arg met pro leu ser asp his glu gin arg met leu 

91/31 

qa^caq ate gag age get etc tac gee gaa gat ecc aag tte gea teg agt gte egt gge 
asp gin til glu ser ala leu tyr ala glu asp^pro lys phe ala ser ser val arg gly 

ggg'ggc tte cgc gea eeg ace gcg egg egg cgc etg cag gge gcg geg ttg tte ate ate 
gly gly phe arg ala pro thr ala arg arg jrg^leu gin gly ala ala leu phe He He 

qg^ctg ggg atg ttg gtt tec gge gtg gcg tte aaa gag ace atg ate gga agt tte eeg 
gly leu gly me? leu val ser gly val ala phe^lys glu thr met He gly ser phe pro 

aJa'ctc age gtt tte ggt ttt gte gtg atg tte ggt ggt gtg gtg tat gee ate ace ggt 
til tin sir val phe gly phe val val met phe^gly gly val val tyr ala xle thr gly 

ect'eqq ttg tec gge agg atg gat egt gge gga teg get get ggg get teg cgc cag egt 
pro arg til ser lly arg me? asp arg gly gly ser ala ala gly ala ser arg gin arg 
^fi1/l?l 391/131 

eat ace aag ggg gee ggg gge tea tte aec age egt atg gaa gat egg tte egg cgc cgc 
tig ?nr Ty l g l y lla gly gly ser phe thr ser arg met glu asp arg phe arg arg arg 
421/141 



tte gac gag taa 
phe asp glu OCH 



SEQ ID No.45F 



FIGURE 45F 




154/185 



1/1 

cag ccg cgc cgc ate gac cag 
gin pro arg arg ile asp gin 
61/21 

tec ggt gaa cat cgc acc agg 
ser gly glu his arg thr arg 
121/41 

ccg gec aac tea cag aca ccc 
pro ala asn ser gin thr pro 
181/61 

cgt cgc ate gee gtc etc gee 
arg arg ile ala val leu ala 
241/81 

teg ggc teg aag cca age ggc 
ser gly ser lys pro ser gly 
301/101 

acc gcg cag acc aag get etc 
thr ala gin thr lys ala leu 



31/11 

ggc etc acg ccc ggt cac 
gly leu thr pro gly his 
91/31 

tta ggc age aat ccc gcg 
leu gly ser asn pro ala 

151/51 
tct acg atg cag ggt atg 
ser thr met gin gly met 

211/71 
gec gtt age ate gec gec 
ala val ser ile ala ala 

271/91 
gga cca ctt ccg gac gcg 
gly pro leu pro asp ala 

331/111 
aag age gcg cac atg gtg 
lys ser ala his met val 



SEQ ID No.46A 
FIGURE 46A 



ttc tec gcg ttc etc aac aat 
phe ser ala phe leu asn asn 

gac ccg cac ccc act cgc cga 
asp pro his pro thr arg arg 

egg acc ccc aga cgc cac tgc 
arg thr pro arg arg his cys 

act gtc gtt gec ggc tgc teg 
thr val val ala gly cys ser 

aag ccg ctg gtc gag gag gee 
lys pro leu val glu glu ala 

ctg acg gtc aac ggc aag ate 
leu thr val asn gly lys ile 



1/1 

age cgc gee gca 
ser arg ala ala 
61/21 

ccg gtg aac ate 
pro val asn ile 
121/41 

egg cca act cac 
arg pro thr his 
181/61 

gtc gca teg ccg 
val ala ser pro 
241/81 

egg get cga age 
arg ala arg ser 
301/101 

ccg cgc aga cca 
pro arg arg pro 



teg acc agg gec 
ser thr arg ala 

gca cca ggt tag 
ala pro gly AMB 

aga cac cct eta 
arg his pro leu 

tec teg ccg ccg 
ser ser pro pro 

caa gcg gcg gac 
gin ala ala asp 

agg etc tea aga 
arg leu ser arg 



31/11 
tea cgc ccg gtc 
ser arg pro val 

91/31 
gca gca ate ccg 
ala ala ile pro 

151/51 
cga tgc agg gta 
arg cys arg val 

211/71 
tta gca teg ccg 
leu ala ser pro 

271/91 
cac ttc egg acg 
his phe arg thr 
331/111 
gcg cgc aca tgg 
ala arg thr trp 



act tct ccg cgt 
thr ser pro arg 

egg acc cgc acc 
arg thr arg thr 

tgc gga ccc cca 
cys gly pro pro 

cca ctg teg ttg 
pro leu ser leu 

cga age cgc tgg 
arg ser arg trp 

tgc tga egg tea 
cys OPA arg ser 



tec tea aca att 
ser ser thr ile 

cca etc gec gac 
pro leu ala asp 

gac gec act gec 
asp ala thr ala 

ccg get get cgt 
pro ala ala arg 

teg agg agg cca 
ser arg arg pro 

acg gca aga tc 
thr ala arg 



SEQ ID No.4 6B 
FIGURE 46B 




155/185 



1/1 

gcc gcg ccg cat 
ala ala pro his 
61/21 

egg tga aca teg 
arg OPA thr ser 
121/41 

ggc caa etc aca 
gly gin leu thr 
181/61 

teg cat cgc cgt 
ser his arg arg 
241/81 

ggg etc gaa gcc 
gly leu glu ala 
301/101 

cgc gca gac caa 
arg ala asp gin 



cga cca ggg cct 
arg pro gly pro 

cac cag gtt agg 
his gin val arg 

gac acc etc tac 
asp thr leu tyr 

cct cgc cgc cgt 
pro arg arg arg 

aag egg egg acc 
lys arg arg thr 

ggc tct caa gag 
gly ser gin glu 



31/11 
cac gcc egg tea 
his ala arg ser 

91/31 
cag caa tec cgc 
gin gin ser arg 

151/51 
gat gca ggg tat 
asp ala gly tyr 

211/71 
tag cat cgc cgc 
AMB his arg arg 

271/91 
act tec gga cgc 
thr ser gly arg 

331/111 
cgc gca cat ggt 
arg ala his gly 



ctt etc cgc gtt 
leu leu arg val 

gga ccc gca ccc 
gly pro ala pro 

gcg gac ccc cag 
ala asp pro gin 

cac tgt cgt tgc 
his cys arg cys 

gaa gcc get ggt 
glu ala ala gly 

get gac ggt caa 
ala asp gly gin 



cct caa caa ttc 
pro gin gin phe 

cac teg ccg acc 
his ser pro thr 

acg cca ctg ccg 
thr pro leu pro 

egg ctg etc gtc 
arg leu leu val 

cga gga ggc cac 
arg gly gly his 

egg caa gat c 
arg gin asp 



SEQ ID No,4 6C 
FIGURE 46C 



156/185 



Coding sequence Rvl411c predicted by Cole et al . , 1998 (Nature 393: 537-544, and 
containing seq4 6A: 

a 3 a a S 5 a 5 S S S K a £ K K S a S K 
S £ a S K E S 2 2 S I Jr - - K - s - - 

as a a a a a a a n s = - s - - : " E = s 



a a a a a a a a a ss s « = - = a = ss a a 



val xcu — - - 271/91 

241/81 



a a a a a a a a a a S3 a a a a a a a a 
a a a a a as: a a a a| a a a a a a a s 

361/121 391/131 



™1 ttc eat eee gee gee g.c ate tae gac ecc gee eag gtg etg aat eeg gat .« 

a a a ss a a ^ ai * * in vai ieu " n pro asp thr 
aa a a a a s a a a a a a a a a a a a a 
a a a a a a a a a a If a a a a a a a a 
a a a a a a a a a a a a a a s a a a a a 
a a a a a a a a a a aa a a a a a a a a 

/991 691/231 
gtc aaa tgg ggc gag aag gtc cag gtc acg aag ccc ccg gtg age tga 
ill sir lys trp giy glu lys v.l gin val thr lys pro pro val ser OPA 



SEQ ID No.46D 
FIGURE 46D 



157/185 

ORF according to Cole et al., 1998 (Nature 393: 537-544): 
and containing the coding sequence Rvl411c: 

1/1 31/11 

taa etc acc cag gtt gga ccg gtt cag tgt etc ggc cat cac gtc ggc ggt gaa ttg gec 
Ten thr gin 5.1 giy pro val gin cys leu^ly his his val gly gly glu leu ala 

gtfggg caa tac ate gac gac egt cag aca cac gec gtt gac age gat cga gtc gee gtg 
val gly gin tyr ile asp asp arg gin thr Ji« ala val asp ser asp arg val ala val 

ic^ggc gtc ggc ggt aac cat egg aec gcg gat ggt cag ccg cgc cgc ate gac cag ggc 
ala gly val gly gly asn his arg thr ala asp 9ly gin pro arg arg He asp gin gly 

ctc'fcq ccc ggt cac ttc tec gcg ttc etc aac aat tec ggt gaa cat cgc acc agg tea 
leu thr pro gly his phe ser ala phe leu asn^asn ser gly glu his arg thr arg leu 

aac'agc aat ccc gcg gac ccg cac ccc act cgc cga ccg gec aac tea cag aca cec tct 
lly ser "n pro ala asp pro his pro thr arg arg pro ala asn ser gin thr pro ser 

aetata cag ggt atg egg acc ccc aga cgc cac tgc cgt cgc ate gec gtc etc gec gec 
Jhr 2 gll gly me? arg thr pro arg arg his^eys arg arg ile ala val leu ala ala 

att'aqc ate gec gec act gtc gtt gec ggc tgc teg teg ggc teg aag cca age ggc gga 
111 ser ill lla ala thr val val ala gly cys ser ser gly ser lys pro ser gly gly 

cca'ct^ ccg gac gcg aag ccg etg gtc gag gag gee aec gcg cag ace aag get etc aag 
pro feu pro Lp all lys pro leu val glu glu^ala thr ala gin thr lys ala leu lys 

aqc'gcg cac atg gtg etg aeg gtc aac ggc aag ate ccg gga etg tct etg aag acg etg 
sir Til his me? vai leu thr val asn gly lys ile pro gly leu ser leu lys thr leu 

llYlll aat etc acc acc aac ccc acc gee gcg acg gga aac gtc aag etc acg etg ggt 
sir gly asp leu thr thr asn pro thr ala ala thr gly asn val lys leu thr leu gly 
601/201 631/211 

tct gat ate gat gec gac ttc gtg gtg ttc gac ggg ate etg tac gec acc etg acg 
lly ser Lp ile asp ala asp phe val val P*je asp gly He leu tyr ala thr leu thr 



ccc aac cag tgg age gat ttc ggt ccc gee gee gac ate tac gac ccc gec cag gtg etg 
pro asn gll trp sir Lp phe gly pro ala ala^asp ile tyr asp pro ala gin val leu 

la^cca aat acc ggc etg gec aac gtg etg gcg aat ttc gee gac gca aaa gec gaa ggg 
..» pro 1st thr gfy leu ala asn val leu jl.«n phe ala asp ala lys ala glu gly 

III' III ace ate aac ggc cag aac acc ate cgc ate age ggg aag gta teg gca cag gcg 
a?g asp thr ile asn g!y gll asn thr ile arg ile ser gly lys val ser ala gin ala 
841/281 871/291 



gig III cag ata gcg ccg ccg ttc aac gcg acg cag ccg gtg ccg gcg ace gtc tgg att 
va! asn gin ile ala pro pro phe asn ala thr gin pro val pro ala thr val trp He 

£'£ 2 K 2 2 2 a 2 2 2 2 £ 2 S 2 2 2 2 2 

£'£ a sr. a 2 2 s 2 2 S £ 2 e 2 2 2 2 2 2 



1021/341 
tga 
OPA 



SEQ ID No.46F 



FIGURE 46F 




158/185 



1/1 

gag ctg gtc aac ggc gcc ggc 
glu leu val asn gly ala gly 
61/21 

ctg gcc gat gcc cag cag atg 
leu ala asp ala gin gin met 
121/41 

gtg ttg gtg gcc teg ggc age 
val leu val ala ser gly ser 
181/61 

ttc gac get gtg atg gac gcg 
phe asp ala val met asp ala 
241/81 

egg gtg ctg etc gag cag ggt 
arg val leu leu glu gin gly 
301/101 

ggg ttg ggc aat gcc gcc ggt 
gly leu gly asn ala ala gly 



31/11 

ate gac gac gcc gcc gtc 
ile asp asp ala ala val 
91/31 

gtc gag gcg gca ctg ggc 
val glu ala ala leu gly 
151/51 

aac cat gtg gcg ccc att 
asn his val ala pro ile 

211/71 
aac gtg egg ggt gcc tgg 
asn val arg gly ala trp 

271/91 

cag ggc ggc age gtg gtg 
gin gly gly ser val val 

331/111 
tac age gcg tac tgc ccg 
tyr ser ala tyr cys pro 



SEQ ID No.47A 
FIGURE 47A 



gtg acc tgc egg ccg gac age 
val thr cys arg pro asp ser 

cga tat ggc cgt ttg gac gga 
arg tyr gly arg leu asp gly 

acc gag atg gcc gtc gag gac 
thr glu met ala val glu asp 

ctg gtg tgt egg gcg gcc gga 
leu val cys arg ala ala gly 

ctg gtg teg tec gtt cgc ggc 
leu val ser ser val arg gly 

teg aag gcg ggc acc gat c 
ser lys ala gly thr asp 



1/1 

age tgg tea acg 
ser trp ser thr 
61/21 

tgg ccg atg ccc 
trp pro met pro 
121/41 

tgt tgg tgg cct 
cys trp trp pro 
181/61 

teg acg ctg tga 
ser thr leu OPA 
241/81 

ggg tgc tgc teg 
gly cys cys ser 
301/101 

ggt tgg gca atg 
gly trp ala met 



gcg ccg gca teg 
ala pro ala ser 

age aga tgg teg 
ser arg trp ser 

egg gca gca acc 
arg ala ala thr 

tgg acg cga acg 
trp thr arg thr 

age agg gtc agg 
ser arg val arg 

ccg ccg gtt aca 
pro pro val thr 



31/11 
acg acg ccg ccg 
thr thr pro pro 

91/31 
agg egg cac tgg 
arg arg his trp 

151/51 
atg tgg cgc cca 
met trp arg pro 

211/71 
tgc ggg gtg cct 
cys gly val pro 

271/91 
gcg gca gcg tgg 
ala ala ala trp 
331/111 
gcg cgt act gcc 
ala arg thr ala 



teg tga cct gcc 
ser OPA pro ala 

gcc gat atg gcc 
ala asp met ala 

tta ccg aga tgg 
leu pro arg trp 

ggc tgg tgt gtc 
gly trp cys val 

tgc tgg tgt cgt 
cys trp cys arg 

cgt cga agg egg 
arg arg arg arg 



ggc egg aca gcc 
gly arg thr ala 

gtt tgg acg gag 
val trp thr glu 

ccg teg agg act 
pro ser arg thr 

ggg egg ccg gac 
gly arg pro asp 

ccg ttc gcg gcg 
pro phe ala ala 

gca ccg ate 
ala pro ile 



SEQ ID No.47B 
FIGURE 47B 
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1/1 

get ggt caa egg cgc 
ala gly gin arg arg 
61/21 

ggc cga tgc cca gca 
gly arg cys pro ala 
121/41 

gtt ggt ggc etc ggg 
val gly gly i eu ^Y 

181/61 

cga cgc tgt gat gga 
arg arg cys asp gly 
241/81 

ggt get get cga gca 
gly ala ala arg ala 
301/101 

gtt ggg caa tgc cgc 
val gly gin cys arg 



egg cat cga cga cgc 
arg his arg arg arg 

gat ggt cga ggc ggc 
asp gly arg gly gly 

cag caa cca tgt ggc 
gin gin pro cys gly 

cgc gaa cgt gcg ggg 
arg glu arg ala gly 

ggg tea ggg egg cag 
gly ser gly arg gin 

egg tta cag cgc gta 
arg leu gin arg val 



31/11 

cgc cgt cgt gac ctg 
arg arg arg asp leu 
91/31 

act ggg ccg ata tgg 
thr gly pro ile trp 
151/51 

gee cat tac cga gat 
ala his tyr arg asp 
211/71 

tgc ctg get ggt gtg 
cys leu ala gly val 
271/91 

cgt ggt get ggt gtc 
arg gly ala gly val 
331/111 

ctg ccc gtc gaa ggc 
leu pro val glu gly 



ccg gee gga cag cct 
pro ala gly gin pro 

ccg ttt gga egg agt 
pro phe gly arg ser 

ggc cgt cga gga ctt 
gly arg arg gly leu 

teg ggc ggc egg acg 
ser gly gly arg thr 

gtc cgt teg egg egg 
val arg ser arg arg 

ggg cac cga tc 
gly his arg 



SEQ ID No.47C 
FIGURE 47C 



160/185 



Coding sequence Rvl714 predicted by Cole et al . , 1998 (Nature 393: 537-544) and 
containing seq 47A: 



1/1 

gtg gag gaa atg gcg ctg get 
val glu glu met ala leu ala 
61/21 

cag gac aag teg ate ctg ate 
gin asp lys ser ile leu ile 
121/41 

gcg ctg gee gac gcg gga gcg 
ala leu ala asp ala gly ala 
181/61 

gag ctg gtc aac ggc gec ggc 
glu leu val asn gly ala gly 
241/81 

ctg gee gat gec cag cag atg 
leu ala asp ala gin gin met 
301/101 

gtg ttg gtg gec teg ggc age 
val leu val ala ser gly ser 
361/121 

ttc gac get gtg atg gac gcg 
phe asp ala val met asp ala 
421/141 

egg gtg ctg etc gag cag ggt 
arg val leu leu glu gin gly 
481/161 

ggg ttg ggc aat gee gee ggt 
gly leu gly asn ala ala gly 
541/181 

ttg gee aag aca ttg gcg gee 
leu ala lys thr leu ala ala 
601/201 

ccg acg gtg ttt egg tec gcg 
pro thr val phe arg ser ala 
661/221 

gec acc egg gag gcg atg etc 
ala thr arg glu ala met leu 
721/241 

ttc gtc ggc gee ctg ate tat 
phe val gly ala leu ile tyr 
781/261 

atg tat ctg gac ggc ggg tac 
met tyr leu asp gly gly tyr 



31/11 

cag cag gtg ccg aac ctg 
gin gin val pro asn leu 
91/31 

acc ggc gcg acc ggt teg 
thr gly ala thr gly ser 
151/51 

egg ctg aca ctg gec ggc 
arg leu thr leu ala gly 

211/71 
ate gac gac gee gee gtc 
ile asp asp ala ala val 

271/91 
gtc gag gcg gca ctg ggc 
val glu ala ala leu gly 

331/111 
aac cat gtg gcg ccc att 
asn his val ala pro ile 

391/131 
aac gtg egg ggt gec tgg 
asn val arg gly ala trp 

451/151 
cag ggc ggc age gtg gtg 
gin gly gly ser val val 

511/171 
tac age gcg tac tgc ccg 
tyr ser ala tyr cys pro 

571/191 
gaa tgg ggc ggt cac ggc 
glu trp gly gly his gly 

631/211 
gtg acc gag tgg atg ttc 
val thr glu trp met phe 

691/231 
gec egg ate ccg ttg cgc 
ala arg ile pro leu arg 

751/251 
ctg etc age gac gee teg 
leu leu ser asp ala ser 

811/271 
acc gca tgc tga 
thr ala cys OPA 



SEQ ID No.47D 
FIGURE 47D 



ggt ctg gcg cgc ttc age gtg 
gly leu ala arg phe ser val 

ttg ggc cga gtt gec gec egg 
leu gly arg val ala ala arg 

ggc aac teg gec ggt ctg gee 
gly asn ser ala gly leu ala 

gtg acc tgc egg ccg gac age 
val thr cys arg pro asp ser 

cga tat ggc cgt ttg gac gga 
arg tyr gly arg leu asp gly 

acc gag atg gee gtc gag gac 
thr glu met ala val glu asp 

ctg gtg tgt egg gcg gee gga 
leu val cys arg ala ala gly 

ctg gtg teg tec gtt cgc ggc 
leu val ser ser val arg gly 

teg aag gcg ggc acc gat ctg 
ser lys ala gly thr asp leu 

att egg gtg aac gcg ctg gcg 
ile arg val asn ala leu ala 

acc gac gat ccg aag ggc egg 
thr asp asp pro lys gly arg 

cgc ttc gee gaa ccg gaa gac 
arg phe ala glu pro glu asp 

age ttc tac acc ggc cag gtg 
ser phe tyr thr gly gin val 



161/185 



ORF according to Cole et al., 1998 (Nature 393: 537-544) and containing the 

coding sequence Rvl714: 



24/1 

tag gtg gag gaa atg gcg ctg 
AMB val glu glu met ala leu 
84/21 

gtg cag gac aag teg ate ctg 
val gin asp lys ser ile leu 
144/41 

egg gcg ctg gec gac gcg gga 
arg ala leu ala asp ala gly 
204/61 

gec gag ctg gtc aac ggc gec 
ala glu leu val asn gly ala 
264/81 

age ctg gec gat gec cag cag 
ser leu ala asp ala gin gin 
324/101 

gga gtg ttg gtg gec teg ggc 
gly val leu val ala ser gly 
384/121 

gac ttc gac get gtg atg gac 
asp phe asp ala val met asp 
444/141 

gga egg gtg ctg etc gag cag 
gly arg val leu leu glu gin 
504/161 

ggc ggg ttg ggc aat gec gec 

gly gly leu 9 lv asn ala ala 

564/181 

ctg ttg gec aag aca ttg gcg 
leu leu ala lys thr leu ala 
624/201 

gcg ccg acg gtg ttt egg tec 
ala pro thr val phe arg ser 
684/221 

egg gee acc egg gag gcg atg 
arg ala thr arg glu ala met 
744/241 

gac ttc gtc ggc gec ctg ate 
asp phe val gly ala leu ile 
804/261 

gtg atg tat ctg gac ggc ggg 
val met tyr leu asp gly gly 



54/11 

get cag cag gtg ccg aac 
ala gin gin val pro asn 
114/31 

ate acc ggc gcg acc ggt 
ile thr gly ala thr gly 
174/51 

gcg egg ctg aca ctg gee 
ala arg leu thr leu ala 

234/71 
ggc ate gac gac gee gee 
gly ile asp asp ala ala 

294/91 
atg gtc gag gcg gca ctg 
met val glu ala ala leu 

354/111 
age aac cat gtg gcg ccc 
ser asn his val ala pro 

414/131 
gcg aac gtg egg ggt gee 
ala asn val arg gly ala 

474/151 
ggt cag ggc ggc age gtg 
gly gin gly gly ser val 

534/171 
ggt tac age gcg tac tgc 
gly tyr ser ala tyr cys 

594/191 
gee gaa tgg ggc ggt cac 
ala glu trp gly gly his 

654/211 
gcg gtg acc gag tgg atg 
ala val thr glu trp met 

714/231 
etc gec egg ate ccg ttg 
leu ala arg ile pro leu 

774/251 
tat ctg etc age gac gee 
tyr leu leu ser asp ala 

834/271 
tac acc gca tgc tga 
tyr thr ala cys OPA 



ctg ggt ctg gcg cgc ttc age 
leu gly leu ala arg phe ser 

teg ttg ggc cga gtt gec gec 
ser leu gly arg val ala ala 

ggc ggc aac teg gec ggt ctg 
gly gly asn ser ala gly leu 

gtc gtg acc tgc egg ccg gac 
val val thr cys arg pro asp 

ggc cga tat ggc cgt ttg gac 
gly arg tyr gly arg leu asp 

att acc gag atg gec gtc gag 
ile thr glu met ala val glu 

tgg ctg gtg tgt egg gcg gee 
trp leu val cys arg ala ala 

gtg ctg gtg teg tec gtt cgc 
val leu val ser ser val arg 

ccg teg aag gcg ggc acc gat 
pro ser lys ala gly thr asp 

ggc att egg gtg aac gcg ctg 
gly ile arg val asn ala leu 

ttc acc gac gat ccg aag ggc 
phe thr asp asp pro lys gly 

cgc cgc ttc gee gaa ccg gaa 
arg arg phe ala glu pro glu 

teg age ttc tac acc ggc cag 
ser ser phe tyr thr gly gin 



SEQ ID No.47F 
FIGURE 47F 
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1/1 31/11 

agg etc atg age aag acg gtt etc ate ctt ggc gcg ggt gtc ggc ggc ctg ace acc gee 

ara leu met ser lys thr val leu ile leu gly ala gly val gly gly leu thr thr ala 

61/21 91/31 

gac acc etc cgt caa ctg eta cca cct gag gat c 

asp thr leu arg gin leu leu pro pro glu asp 



SEQ ID No.48A 
FIGURE 48A 



1/1 31/11 



qqc tea tga gca aga egg ttc tea tec ttg gcg egg gtg teg gcg gee tga cca ceg ccg 
aly ser OPA ala arg arg P he ser ser leu ala arg val ser ala ala OPA pro pro pro 

61/21 91/31 
aca ccc tec gtc aac tgc tac cac ctg agg ate 
thr pro ser val asn cys tyr his leu arg ile 



SEQ ID No.48B 
FIGURE 48B 



1/1 31/11 



qct cat gag caa gac ggt tct cat cct tgg cgc ggg tgt egg egg cct gac cac cgc cga 
ala his glu gin asp gly ser his pro trp arg gly cys arg arg pro asp his arg arg 
61/21 

cac cct ccg tea act get acc acc tga gga tc 
his pro pro ser thr ala thr thr OPA gly 



SEQ ID No.48C 
FIGURE 48C 
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coding sequence Rv0331 predicted by Cole et al., 1998 (Nature 393: 537-544) and 
containing seq48A: 

1/1 31/11 

ata aac aag acg gtt etc ate ctt ggc gcg ggt gtc ggc ggc ctg acc acc gec gac acc 
Me? sir lyl thr val leu ile leu gly ala gly val gly gly leu thr thr ala asp thr 
fil /21 91/31 

etc egt caa ctg eta cca cct gag gat cga ate ata ttg gtg gac agg age ttt gac ggg 
Tel trl gin leu leu pro pro glu asp arg ile ile leu val asp arg ser phe asp gly 

191/41 151/51 

acg ctg gge ttg teg ttg eta tgg gtg ttg egg ggc tgg egg egg eet gac gac gtc egc 
thr leu gly leu ser leu leu trp val leu arg gly trp arg arg pro asp asp val arg 

i ft 1 /61 211/71 

all egc ece acc gcg gcg teg ctg ccc ggt gtg gaa atg gtt act gca acc gtc gee cac 
val arg pro thr III ill ser leu pro gly val^lu met val thr ala thr val ala his 

att'gac ate gcg gec cag gta gtg cac acc gac aac age gtc ate ggc tat gac gcg ttg 
111 Isp ile all Ila gin val val his thr asp asn ser val ile gly tyr asp ala leu 

^m/101 331/111 

ata ate gca tta ggt gcg gcg ctg aac acc gac gee gtt ccc gga ctg teg gac gcg etc 
III Tit III leu g!y ala 111 leu asn thr asp ala val pro gly leu ser asp ala leu 
^fil/121 391/131 

aac gec gac gtc gcg ggc cag ttc tac acc ctg gac ggc gcg get gag ctg egt gcg aag 
Isp lla Isp val 111 gly gin phe tyr thr leu asp gly ala ala glu leu arg ala lys 
421/141 451/151 

rtcqaq gcg etc gag cat ggc egg ate get gtg get ate gec ggg gtg ccg ttc aaa tgc 
val gtl HI leu glu his g?y arg ile ala val ala ile ala gly val pro phe lys cys 
dfti/ifil 511/171 

cca gee gca ccg ttc gaa gcg gcg ttt ctg ate gec gee caa etc ggt gac egc tac gec 
pro 111 HI pro phe glu III ala phe leu ile ala ala gin leu gly asp arg tyr ala 

571/ 191 

acc'gga acc gta cag ate gac acg ttc acg eet gac ccg ctg ccg atg ccc gtt gca ggt 
thr gly thr val gin ile asp thr phe thr pro asp pro leu pro met pro val ala gly 

^ J 3 631/211 

III' III gtc ggc gag get ttg gtc teg atg etc aag gat cac ggt gtc ggc ttc cat cct 
pro 111 val gly glu Ila lei val ser met leu lys asp his gly val gly phe his pro 
fifil/221 691/231 

cac aag gec eta get egc gtc gat gag gec gca agg acg atg cac ttc ggt gac gge acg 
arg lyl £a leu Ila arg val asp glu ala ala arg thr met his phe gly asp gly thr 
721/241 751/251 

tec aaa ccg ttc gat ctg ctt gec gtg gtc ccc ccg cac gtg ccc tec gee gcg gcg egg 
ser glu pro £e Lp lei leu Ila val val pro pro his val pro ser ala ala ala arg 

lll'ltl ggt etc age gaa tec ggg tgg ata ccc gtg gac ccg egc acc ctg tec act age 
sir 111 Tv 1- sir glu ser gly trp ile pr^val asp pro arg thr leu ser thr ser 

acc'aac aac gtg tgg gec ate ggc gat gcg acc gtg ctg acg ctg ccg aat ggc aaa ccg 
111 Isp asn S tip Ila ile gly asp ala thr val leu thr leu pro asn gly lys pro 

om /^ni 931/311 

ctg ece aag get gec gtg ttc gee gaa gec cag gee gea gtt gtc gee cac gge gtc gec 
Tel pro lyl Ila Ila val phe ala glu ala gin ala ala val val ala hxs gly val ala 

caveat etc ggt tac gac gta get gag egc cac ttc acc ggc acg ggc gec tgc tac gtc 
alg Ms ill gly tyr lap val Ila glu arg his phe thr gly thr gly ala cys tyr val 
1021/341 1051/351 



gagacc ggt gat cac eag gca gec aag ggc gac ggc gat ttc ttc get ecg teg gag cec 
III thr gly Isp his gin ala ala lys gly *JJ g^* 8 * phe phe ala pr ° S6r * 

tc^at^acq ctg tac ccg ccg teg egg gag ttt cac gag gag aag gtc gca caa gaa ctg 
S3 111 S lei lyr pro pro s.r arg glu phe his glu glu lys val ala gin glu leu 



1141/381 

gec tgg ctg acc egc tgg aag acg tga 
ala trp leu thr arg trp lys thr OPA 



SEQ ID No.48D 
FIGURE 48D 
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ORF according to Cole et al., 1998 (Nature 393: 537-544) and containing coding 
sequence Rv0331: 



1/1 

tga aca ccc gcg ccg 
OPA thr pro ala pro 
61/21 

cca tgg gcc gat aat 
pro trp ala asp asn 
121/41 

gcg gcg gaa tct aat 
ala ala glu ser asn 
181/61 

age aag acg gtt etc 
ser lys thr val leu 
241/81 

cgt caa ctg eta cca 
arg gin leu leu pro 
301/101 

ctg ggc ttg teg ttg 
leu gly leu ser leu 
361/121 

cgc ccc acc gcg gcg 
arg pro thr ala ala 
421/141 

gac ate gcg gcc cag 
asp ile ala ala gin 
481/161 

ate gca tta ggt gcg 
ile ala leu gly ala 
541/181 

gcc gac gtc gcg ggc 
ala asp val ala gly 
601/201 

gag gcg etc gag cat 
glu ala leu glu his 
661/221 

gcc gca ccg ttc gaa 
ala ala pro phe glu 
721/241 

gga acc gta cag ate 
gly thr val gin ile 
781/261 

gag gtc ggc gag get 
glu val gly glu ala 
841/281 

aag gcc eta get cgc 
lys ala leu ala arg 



acg egg cga caa teg 
thr arg arg gin ser 

agt ttg act gac teg 
ser leu thr asp ser 

att cca aag ata tat 
ile pro lys ile tyr 

ate ctt ggc gcg ggt 
ile leu gly ala gly 

cct gag gat cga ate 
pro glu asp arg ile 

eta tgg gtg ttg egg 
leu trp val leu arg 

teg ctg ccc ggt gtg 
ser leu pro gly val 

gta gtg cac acc gac 
val val his thr asp 

gcg ctg aac acc gac 
ala leu asn thr asp 

cag ttc tac acc ctg 
gin phe tyr thr leu 

ggc egg ate get gtg 
gly arg ile ala val 

gcg gcg ttt ctg ate 
ala ala phe leu ile 

gac acg ttc acg cct 
asp thr phe thr pro 

ttg gtc teg atg etc 
leu val ser met leu 

gtc gat gag gcc gca 
val asp glu ala ala 



31/11 

egg aaa acc ggt ccg 
arg lys thr gly pro 
91/31 

gtc agt cac ccc aag 
val ser his pro lys 
151/51 

gga act cga tgc gaa 
gly thr arg cys glu 
211/71 

gtc ggc ggc ctg acc 
val gl* gly leu thr 
271/91 

ata ttg gtg gac agg 
ile leu val asp arg 
331/111 

ggc tgg egg egg cct 
gly trp arg arg pro 
391/131 

gaa atg gtt act gca 
glu met val thr ala 
451/151 

aac age gtc ate ggc 
asn ser val ile gly 
511/171 

gcc gtt ccc gga ctg 
ala val pro gly leu 
571/191 

gac ggc gcg get gag 
asp gly ala ala glu 
631/211 

get ate gcc ggg gtg 
ala ile ala gly val 
691/231 

gcc gcc caa etc ggt 
ala ala gin leu gly 
751/251 

gac ccg ctg ccg atg 
asp pro leu pro met 
811/271 

aag gat cac ggt gtc 
lys asp his gly val 
871/291 

agg acg atg cac ttc 
arg thr met his phe 



egg gaa tgc tgc ggg 
arg glu cys cys gly 

acc ttg cgc aag act 
thr leu arg lys thr 

gga ate agg etc atg 
gly ile arg leu met 

acc gcc gac acc etc 
thr ala asp thr leu 

age ttt gac ggg acg 
ser phe asp gly thr 

gac gac gtc. cgc gtc 
asp asp val arg val 

acc gtc gcc cac att 
thr val ala his ile 

tat gac gcg ttg gtg 
tyr asp ala leu val 

teg gac gcg etc gac 
ser asp ala leu asp 

ctg cgt gcg aag gtc 
leu arg ala lys val 

ccg ttc aaa tgc cca 
pro phe lys cys pro 

gac cgc tac gcc acc 
asp arg tyr ala thr 

ccc gtt gca ggt ccc 
pro val ala gly pro 

ggc ttc cat cct cgc 
gly phe his pro arg 

ggt gac ggc acg tec 
gly asp gly thr ser 



SEQ ID No.48F 
FIGURE 48F 
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901/301 

gaa ccg ttc gat ctg 
glu pro phe asp leu 
961/321 

gcg ggt etc age gaa 
ala gly leu ser.glu 
1021/341 

gac aac gtg tgg gec 
asp asn val trp ala 
1081/361 

ccc aag get gee gtg 
pro lys ala ala val 
1141/381 

cat etc ggt tac gac 
his leu gly tyr asp 
1201/401 

ace ggt gat cac cag 
thr gly asp his gin 
1261/421 

gtg acg ctg tac ccg 
val thr leu tyr pro 
1321/441 

tgg ctg acc cgc tgg 
trp leu thr arg trp 



ctt gec gtg gtc ccc 
leu ala val val pro 

tec ggg tgg ata ccc 
ser gly trp ile pro 

ate ggc gat gcg acc 
ile gly asp ala thr 

ttc gec gaa gee cag 
phe ala glu ala gin 

gta get gag cgc cac 
val ala glu arg his 

gca gec aag ggc gac 
ala ala lys gly asp 

ccg teg egg gag ttt 
pro ser arg glu phe 

aag acg tga 
lys thr OPA 



931/311 

ccg cac gtg ccc tec 
pro his val pro ser 
991/331 

gtg gac ccg cgc acc 
val^ asp pro arg thr 
1051/351 

gtg ctg acg ctg ccg 
val leu thr leu pro 
1111/371 

gee gca gtt gtc gee 
ala ala val val ala 
1171/391 

ttc acc ggc acg ggc 
phe thr gly thr gly 
1231/411 

ggc gat ttc ttc get 
gly asp phe phe ala 
1291/431 

cac gag gag aag gtc 
his glu glu lys val 



gee gcg gcg egg tea 
ala ala ala arg ser 

ctg tec act age gec 
leu ser thr ser ala 

aat ggc aaa ccg ctg 
asn gly lys pro leu 

cac ggc gtc gee cgc 
his gly val ala arg 

gee tgc tac gtc gag 
ala cys tyr val glu 

ccg teg gcg ccc teg 
pro ser ala pro ser 

gca caa gaa ctg gee 
ala gin glu* leu ala 



SEQ ID No.48F (continued) 
FIGURE 48F (continued) 




166/185 



Fragment amplified by PCR 
protease of the E.coli htrA 
of the SnaBI site at the 3' 

1/1 

cca tct aca ccg etc aac age 
pro ser thr pro leu asn ser 
61/21 

tga tec gtg gcg agt tgt tea 
OPA ser val ala ser cys ser 
121/41 

tec gtc tga cca acg gta gtt 
ser val OPA pro thr val val 
181/61 

tea tga aca age tgc gtt ggg 
ser OPA thr ser cys val gly 
241/81 

egg tgg ccg ggg gga tgg tea 
arg trp pro gly gly trp ser 
301/101 

egg ccg age ggg tgg cgc gaa 
arg pro ser gly trp arg glu 
361/121 

aat tgg cca ggc tga cag agg 
asn trp pro gly OPA gin arg 
421/141 

aac ggc agg caa ggc tgg tta 
asn gly arg gin gly trp leu 
481/161 

tgc gca cca atg teg aac tct 
cys ala pro met ser asn ser 
541/181 

age agg aga tgg teg ace tgc 
ser arg arg trp ser thr cys 
601/201 

tgg tag gcg att tgg tgg ace 
trp AMB ala ile trp trp thr 
661/221 

teg aca tgg ctg acg teg teg 
ser thr trp leu thr ser ser 
721/241 

tec ttt teg acg teg agg tga 
ser phe ser thr ser arg OPA 
781/261 

gga tgg cgc tta ace tga tgg 
gly trp arg leu thr OPA trp 
841/281 

gtg tea ggc tga gee age teg 
val ser gly OPA ala ser ser 



based on the sequence 
family (creation of the 
end) and subcloned into 

31/11 

egg gee aga cgc tgc egg 
arg ala arg arg cys arg 
91/31 

tgt cgc ggc gca cca ccg 
cys arg gly ala pro pro 
151/51 

cgc tgc tga tct cca aaa 
arg cys OPA ser pro lys 

211/71 
tgc tat tga teg tgg gtg 
cys tyr OPA ser trp val 

271/91 
ccc ggg ccg ggc tga ggc 
pro gly pro gly OPA gly 

331/111 
ccg acg ace tgc ggc cca 
pro thr thr cys gly pro 

391/131 
cat tea att taa tgc tgc 
his ser ile OCH cys cys 

451/151 
ccg acg ccg gac atg aat 
pro thr pro asp met asn 

511/171 
tga tgg cct cga tgg ccc 
OPA trp pro arg trp pro 

571/191 
gtg ccg atg tgc tgg etc 
val pro met cys trp leu 

631/211 
tgt ccc gag gcg acg ccg 
cys pro glu ala thr pro 

691/231 
ace gca gec tgg age ggg 
thr ala ala trp ser gly 

751/251 
ttg ggt ggc agg ttt atg 
leu gly gly arg phe met 

811/271 
aca acg ccg cga agt gga 
thr thr pro arg ser gly 

871/291 
acg cgt cgc acg ctg age 
thr arg arg thr leu ser 



similarities with a serine 
BamHI site at the 5' end and 
the vector pJVED: 



teg gtg ctg ccg aga agg egg 
ser val leu pro arg arg arg 

ccg ace aac ggg tgc ttg cca 
pro thr asn gly cys leu pro 

gtc tea age cca ccg aag cag 
val ser ser pro pro lys gin 

gga teg ggg tgg egg teg ccg 
gly ser gly trp arg ser pro 

egg tgg gee gee tea ccg aag 
arg trp ala ala ser pro lys 

tec ccg tct teg gca gcg acg 
ser pro ser ser ala ala thr 

ggg cgc tgg ccg agt cac ggg 
gly arg trp pro ser his gly 

tgc gta ccc cgc taa cgt cgc 
cys val pro arg OCH arg arg 

egg ggg etc cgc ggc tac cca 
arg gly leu arg. gly tyr pro 

aaa teg agg aat tgt cca cac 
lys ser arg asn cys pro his 

gag aag tgg tgc acg age egg 
glu lys trp cys thr ser arg 

tea ggc ggc ggc gca acg ata 
ser gly gly gly ala thr ile 

gcg ata ccg ctg gat tgt cgc 
ala ile pro leu asp cys arg 

gee cgc egg gcg gee acg tgg 
ala arg arg ala ala thr trp 

tgg tgg ttt ccg ace gcg gee 
trp trp phe pro thr ala ala 



SEQ ID No.49A 
FIGURE 49A 



167/185 



931/ 311 

egg gca ttc ccg tgc agg age gec gtc tgg tgt ttg aac ggt ttt acc ggt egg cat egg 
arg ala phe pro cys arg ser ala val trp cys leu asn gly phe thr gly arg his arg 
961/321 991/331 

cac ggg cgt tgc egg gtt egg. gec teg ggt tgg cga teg tea aac agg tgg tgc tea acc 
his gly arg cys arg val arg ala ser gly trp arg ser ser asn arg trp cys ser thr 
1021/341 1051/351 

acg gcg gat tgc tgc gca teg aag aca ccg acc cag gcg gec age ccc ctg gaa cgt cga 
thr all asp cys cys ala ser lys thr pro thr gin ala ala ser pro leu glu arg arg 
1081/361 1111/371 

ttt acg tgc tgc tec ccg gee gtc gga tgc cga ttc cgc age ttc ccg gtg cga egg ctg 
ohe thr cys cys ser pro ala val gly cys arg phe arg ser phe pro val arg arg leu 

1171/391 

gc^ctc gga gca egg aca teg aga act etc ggg gtt egg cga acg tta tct cag tgg aat 
111 leu gly ala arg thr ser arg thr leu gly val arg arg thr leu ser gin trp asn 
1201/401 1231/411 

etc agt cca cgc gcg caa cct agt tgt gca gtt act gtt gaa age cac acc cat gec agt 
leu ser pro arg ala gin pro ser cys ala val thr val glu ser his thr his ala ser 
1261/421 1291/431 

cca cgc atg gee aag ttg gee cga gta gtg ggc eta gta cag gaa gag caa cct age gac 
pro arg met ala lys leu ala arg val val gly leu val gin glu glu gin pro ser asp 
1321/441 1351/451 

atg acg aat cac cca egg tat teg cca ccg ccg cag cag ccg gga acc cca ggt tat get 
met thr asn his pro arg tyr ser pro pro pro gin gin pro gly thr pro gly tyr ala 
1381/461 1411/471 

cag ggg cag cag caa acg tac age cag cag ttc gac tgg cgt tac cca ccg tec ccg ccc 
gin glv gin gin gin thr tyr ser gin gin phe asp trp arg tyr pro pro ser pro pro 
1441/481 1471/491 

ccg cag cca acc cag tac cgt caa ccc tac gag gcg ttg ggt ggt acc egg ccg ggt ctg 
pro gin pro thr gin tyr arg gin pro tyr glu ala leu gly gly thr arg pro gly leu 

1501/501 1531/511 

ata cct ggc gtg att ccg acc atg acg ccc cct cct ggg atg gtt cgc caa cgc cct cgt 
ile pro gly val ile pro thr met thr pro pro pro gly met val arg gin arg pro arg 
1561/521 1591/531 

gca ggc atg ttg gee ate ggc gcg gtg acg ata gcg gtg gtg tee gec ggc ate ggc ggc 
ala gly met leu ala ile gly ala val thr ile ala val val ser ala gly He gly gly 
1621/541 1651/551 

gcg gec gca tec ctg gtc ggg ttc aac egg gca ccc gec ggc ccc age ggc ggc cca gtg 
Til ala ala ser leu val gly phe asn arg j£ P*^* 1 * gly pr ° S6r gl * gly ?r ° 

qc^gc^agc gcg gcg cca age ate ccc gca gca aac atg ccg ccg ggg teg gtc gaa cag 
III Tit sir 111 111 pro ser ile pro ala ala asn met pro pro gly ser val glu gin 
1741/581 1771/591 

atg gcg gec aag gtg gtg ccc agt gtc gtc atg ttg gaa acc gat ctg ggc cgc cag teg 
S lla Tit lys lal va! pro ser val val met leu glu thr asp leu gly arg gin ser 

1801/601 1831/611 

gag gag gge tec ggc ate att ctg tct gee gag ggg ctg ate ttg ace aac aac cac gtg 
all gll gly ser gly ile ile leu ser ala glu gly leu ile leu thr asn asn hxs val 
1861/621 1891/631 

ate gcg gcg gee gee aag cct ccc ctg ggc agt ccg ccg ccg aaa acg acg gta 
ile ala ala ala ala lys pro pro leu gly ser pro pro pro lys thr thr val 



SEQ ID No.49A (continued 1) 
FIGURE 49A (continued 1) 



# 



168/185 



1/1 31/11 

cat eta cac cgc tea aca gee ggg cca gac get gec ggt egg tgc tgc cga gaa ggc ggt 
his leu his arg ser thr ala gly pro asp ala ala gly arg cys cys arg glu gly gly 
61/21 91/31 

gat ccg tgg cga gtt gtt cat gtc gcg gcg cac cac cgc cga cca acg ggt get tgc cat 
asp pro trp arg val val his val ala ala his his arg arg pro thr gly ala cys his 
121/41 151/51 

ccg tct gac caa egg tag ttc get get gat etc caa aag tct caa gee cac cga age agt 
pro ser asp gin arg AMB phe ala ala asp leu gin lys ser gin ala his arg ser ser 
181/61 211/71 

cat gaa caa get gcg ttg ggt get att gat cgt ggg tgg gat egg ggt ggc ggt cgc cgc 
his glu gin ala ala leu gly ala ile asp arg gly trp asp arg gly gly gly arg arg 
241/81 271/91 

ggt ggc egg ggg gat ggt cac ccg ggc egg get gag gee ggt ggg ccg cct cac cga age 
Qly gly arg gly asp gly his pro gly arg ala glu ala gly gly pro pro his arg ser 
301/101 331/111 

ggc cga gcg ggt ggc gcg aac cga cga cct gcg gee cat ccc cgt ctt egg cag cga cga 
gly arg ala gly gly ala asn arg arg pro ala ala his pro arg leu arg gin arg arg 
361/121 391/131 

att ggc cag get gac aga ggc att caa ttt aat get gcg ggc get ggc cga gtc acg gga 
ile gly gin ala asp arg gly ile gin phe asn ala ala gly ala gly arg val thr gly 
421/141 451/151 

acg gca ggc aag get ggt tac cga cgc egg aca tga att gcg tac ccc get aac gtc get 
thr ala gly lys ala gly tyr arg arg arg thr OPA ile ala tyr pro ala asn val ala 
481/161 511/171 

gcg cac caa tgt cga act ctt gat ggc etc gat ggc ccc ggg ggc tec gcg get ace caa 
ala his gin cys arg thr leu asp gly leu asp gly pro gly gly ser ala ala thr gin 
541/181 571/191 

gca gga gat ggt cga cct gcg tgc cga tgt get ggc tea aat cga gga att gtc cac act 
ala gly asp gly arg pro ala cys arg cys ala gly ser asn arg gly ile val his thr 
601/201 631/211 

ggt agg cga ttt ggt gga cct gtc ccg agg cga cgc egg aga agt ggt gca cga gee ggt 
gly arg arg phe gly gly pro val pro arg arg arg arg arg ser gly ala arg ala gly 
661/221 691/231 

cga cat ggc tga cgt cgt cga ccg cag cct gga gcg ggt cag gcg gcg gcg caa cga tat 
arg his gly OPA arg arg arg pro gin pro gly ala gly gin ala ala ala gin arg tyr 
721/241 751/251 

cct ttt cga cgt cga ggt gat tgg gtg gca ggt tta tgg cga tac cgc tgg att gtc gcg 
pro phe arg arg arg gly asp trp val ala gly leu trp arg tyr arg trp xle val ala 
781/261 811/271 

gat ggc get taa cct gat gga caa cgc cgc gaa gtg gag ccc gee ggg egg cca cgt ggg 
asp gly ala OCH pro asp gly gin arg arg glu val glu pro ala gly arg pro arg gly 
841/281 871/291 

tgt cag get gag cca get cga cgc gtc gca cgc tga get ggt ggt ttc cga ccg egg ccc 
cys gin ala glu pro ala arg arg val ala arg OPA ala gly gly phe arg pro arg pro 
901/301 931/311 

ggg cat tec cgt gca gga gcg ccg tct ggt gtt tga acg gtt tta ccg gtc ggc ate ggc 
gly his ser arg ala gly ala pro ser gly val OPA thr val leu pro val gly lie gly 
961/321 991/331 

acg ggc gtt gee ggg ttc ggg cct egg gtt ggc gat cgt caa aca ggt ggt get caa cca 
thr gly val ala gly phe gly pro arg val gly asp arg gin thr gly gly ala gin pro 
1021/341 1051/351 

egg egg att get gcg cat cga aga cac cga ccc agg egg cca gee ccc tgg aac gtc gat 
arg arg ile ala ala his arg arg his arg pro arg arg pro ala pro trp asn val asp 



SEQ ID NO-49B 
FIGURE 49B 

DTDI A r^lTlVf CUFFT n>fTT IT *76\ 



0 



169/185 



1081/361 1111/371 

tta cgt get get ccc egg ccg teg gat gcc gat tec gca get tec egg tgc gac ggc tgg 
leu arg ala ala pro arg pro ser asp ala asp ser ala ala ser arg cys asp gly trp 
1141/381 1171/391 

cgc teg gag cac gga cat cga gaa etc teg ggg ttc ggc gaa cgt tat etc agt gga ate 
arg ser glu his gly his arg glu leu ser gly phe gly glu arg tyr leu ser gly ile 
1201/401 1231/411 

tea gtc cac gcg cgc aae eta gtt gtg cag tta ctg ttg aaa gcc aca ccc atg cca gtc 
ser val his ala arg asn leu val val gin leu leu leu lys ala thr pro met pro val 
1261/421 1291/431 

cac gca tgg cca agt tgg ccc gag tag tgg gcc tag tac agg aag age aae eta gcg aca 
his ala trp pro ser trp pro glu AMB trp ala AMB tyr arg lys ser asn leu ala thr 
1321/441 1351/451 

tga cga ate acc cac ggt att cgc cac cgc cgc age age egg gaa ccc cag gtt atg etc 
OPA arg ile thr his gly ile arg his arg arg ser ser arg glu pro gin val met leu 
1381/461 1411/471 

agg ggc age age aaa cgt aca gcc age agt teg act ggc gtt acc cac cgt ccc, cgc ccc 
arg gly ser ser lys arg thr ala ser ser ser thr gly val thr his arg pro arg pro 
1441/481 1471/491 

cgc age caa ccc agt acc gtc aac cct acg agg cgt tgg gtg gta ccc ggc egg gtc tga 
arg ser gin pro ser thr val asn pro thr arg arg trp val val pro gly arg val OPA 
1501/501 1531/511 

tac ctg gcg tga ttc cga cca tga cgc ccc etc ctg gga tgg ttc gcc aac gcc etc gtg 
tyr leu ala OPA phe arg pro OPA arg pro leu leu gly trp phe ala asn ala leu val 
1561/521 1591/531 

cag gca tgt tgg cca teg gcg egg tga cga tag egg tgg tgt ccg ccg gca teg gcg gcg 
gin ala cys trp pro ser ala arg OPA arg AMB arg trp cys pro pro ala ser ala ala 
1621/541 1651/551 

egg ccg cat ccc tgg teg ggt tea acc ggg cac ccg ccg gcc cca gcg gcg gcc cag tgg 
arg pro his pro trp ser gly ser thr gly his pro pro ala pro ala ala ala gin trp 
1681/561 1711/571 

ctg cca gcg egg cgc caa gca tec ccg cag caa aca tgc cgc egg ggt egg teg aac agg 
leu pro ala arg arg gin ala ser pro gin gin thr cys arg arg gly arg ser asn arg 
1741/581 1771/591 

tgg egg cca agg tgg tgc cca gtg teg tea tgt tgg aaa ccg ate tgg gcc gcc agt egg 
trp arg pro arg trp cys pro val ser ser cys trp lys pro ile trp ala ala ser arg 
1801/601 1831/611 

agg agg get ccg gca tea ttc tgt ctg ccg agg ggc tga tct tga cca aca acc acg tga 
arg arg ala pro ala ser phe cys leu pro arg gly OPA ser OPA pro thr thr thr OPA 
1861/621 1891/631 

teg egg egg ccg cca age etc ccc tgg gca gtc cgc cgc cga aaa cga egg ta 
ser arg arg pro pro ser leu pro trp ala val arg arg arg lys arg arg 



SEQ ID No.4 9B (continued 1) 
FIGURE 49B (continued 1) 




170/185 



1/1 31/11 



ate tac acc get caa cag cog ggc cag acg ctg cog gtc ggt get gee gag aag geg gtg 
ile tyr thr ala gin gin pro gly gin thr Ij^P" val 9 lv ala ala glu lyS ala Val 



6 li 2 l^ «<,c- aaa tta ttc atg teg egg cgc acc acc gec gac caa egg gtg ctt gee ate 
IS afg g!y glu lei phe -? se? arg arg thr^thr a!a asp gin arg val leu ala ile 

cat'cJq acc aac ggt agt teg ctg ctg ate tec aaa agt etc aag ccc acc gaa gca gtc 
arg leu thr asn fly sir ser leu leu ile ser^lys ser leu lys pro thr glu ala val 

ata'fac aag ctg cgt tgg gtg eta ttg ate gtg ggt ggg ate ggg gtg geg gtc gec geg 
ill asn lys leu arg trp val leu leu ile vainly gly ile gly val ala val ala ala 

Vtl'llc ggg ggg atg gtc acc egg gec ggg ctg agg ccg gtg ggc cgc etc acc gaa geg 
ill Tit gly HI me? val thr arg ala gly leu arg pro val gly arg leu thr glu ala 

aee^ag egg gtg geg ega ace gae gac ctg egg cee ate ccc gtc ttc ggc age gae gaa 
La glu arg S all arg thr asp asp leu arg^ro ile pro val phe gly ser asp glu 

III' III aaa ctg aca gag gca ttc aat tta atg ctg egg geg ctg gec gag tea egg gaa 
ill ala arg ill thr glu ala P he asn leu met^leu arg ala leu ala glu ser arg glu 

caa 7 cag gca agg ctg gtt acc gac gec gga cat gaa ttg cgt acc ccg eta acg teg ctg 
arg gin Tit arg leu val thr Lp ala gly his glu leu arg thr pro leu thr ser leu 

SS a S S £ S 2 K 2 2 = - - - - - s - - 

a a a a a a a s a a as a a a a a a a a 

f;m /9m 631/211 

gat ttg gtg gae ctg tee ega ggc gac gec gga gaa gtg gtg cac gag ccg gtc 
val gly asp Jeu vaJ Lp leu ser arg gly asp ala gly glu val val hxs glu pro val 
^T/^l 691/231 

aac atq get gac gtc gtc gac cgc age ctg gag egg gtc agg egg egg cgc aac gat ate 
Kp me? La Lp val val Lp arg ser leu glu arg val arg arg arg arg asn asp He 

751/251 

ct^tic aac gtc gag gtg att ggg tgg cag gtt tat ggc gat acc get gga ttg teg egg 
leu phe Lp Hi 111 U ile gTy trp gin val^tyr gly asp thr ala gly leu ser arg 

Itq'qcg ctt aac ctg atg gac aac gec geg aag tgg age ccg ccg ggc ggc cac gtg ggt 
111 Til Hu asn leu me? Lp asn ala ala lys trp ser pro pro gly gly hi. val gly 



871/291 



a a a a a a a a a a ss a a a a a a a a 

901/301 931/311 



is a a a a a a s a a a a a a a a a a a a 
3s a a a a a a a a a a a a a a a a a a 



SEQ ID No.49C 
FIGURE 49C 



# 



171/185 



1021/341 

ggc gga ttg ctg cgc 
gly gly leu leu arg 
1081/361 

tac gtg ctg etc ccc 
tyr val leu leu pro 
1141/381 

get egg age acg gac 
ala arg ser thr asp 
1201/401 

cag tec acg cgc gca 
gin ser thr arg ala 
1261/421 

acg cat ggc caa gtt 
thr his gly gin val 
1321/441 

gac gaa tea ccc acg 
asp glu ser pro thr 
1381/461 

ggg gca gca gca aac 
gly ala ala ala asn 
1441/481 

gca gec aac cca gta 
ala ala asn pro val 
1501/501 

acc tgg cgt gat tec 
thr trp arg asp ser 
1561/521 

agg cat gtt ggc cat 
arg his val gly his 
1621/541 

ggc cgc ate cct ggt 
gly arg ile pro gly 
1681/561 

tgc cag cgc ggc gec 
cys gin arg gly ala 
1741/581 

ggc ggc caa ggt ggt 
gly gly gin gly gly 
1801/601 

gga ggg etc egg cat 
gly gly leu arg his 
1861/621 

cgc ggc ggc cgc caa 
arg gly gly arg gin 



ate gaa gac acc gac 
ile glu asp thr asp 

ggc cgt egg atg ccg 
gly arg arg met pro 

ate gag aac tct egg 
ile glu asn ser arg 

acc tag ttg tgc agt 
thr AMB leu cys ser 

ggc ccg agt agt ggg 
gly pro ser ser gly 

gta ttc- gec acc gee 
val. phe ala thr ala 

gta cag cca gca gtt 
val gin pro ala val 

ccg tea acc eta cga 
pro ser thr leu arg 

gac cat gac gee ccc 
asp his asp ala pro 

egg cgc ggt gac gat 
arg arg gly asp asp 

egg gtt caa ccg ggc 
arg val gin pro gly 

aag cat ccc cgc age 
lys his pro arg ser 

gee cag tgt cgt cat 
ala gin cys arg his 

cat tct gtc tgc cga 
his ser val cys arg 

gec tec cct ggg cag 
ala ser pro gly gin 



1051/351 

cca ggc ggc cag ccc 
pro gly gly gin pro 
1111/371 

att ccg cag ctt ccc 
ile pro gin leu pro 
1171/391 

ggt teg gcg aac gtt 
gly ser ala asn val 
1231/411 

tac tgt tga aag cca 
tyr cys OPA lys pro 
1291/431 

cct agt aca gga aga 
pro ser thr gly arg 
1351/451 

gca gca gee ggg aac 
ala ala ala gly asn 
1411/471 

cga ctg gcg tta ccc 
arg leu ala leu pro 
1471/491 

ggc gtt ggg tgg tac 
gly val gly trp tyr 
1531/511 

tec tgg gat ggt teg 
ser trp asp gly ser 
1591/531 

age ggt ggt gtc cgc 
ser gly gly val arg 
1651/551 

acc cgc egg ccc cag 
thr arg arg pro gin 
1711/571 

aaa cat gee gee ggg 
lys his ala ala gly 
1771/591 

gtt gga aac cga tct 
val gly asn arg ser 
1831/611 

ggg g ct 9 at ctt 9 ac 

gly ala asp leu asp 
1891/631 

tec gee gee gaa aac 
ser ala ala glu asn 



cct gga acg teg att 
pro gly thr ser ile 

ggt gcg acg gct ggc 
gly ala thr ala gly 

ate tea gtg gaa tct 
ile ser val glu ser 

cac cca tgc cag tec 
his pro cys gin ser 

gca acc tag cga cat 
ala thr AMB arg his 

ccc agg tta tgc tea 
pro arg leu cys ser 

acc gtc ccc gee ccc 
thr val pro ala pro 

ccg gee ggg tct gat 
pro ala gly ser asp 

cca acg ccc teg tgc 
pro thr pro ser cys 

egg cat egg egg cgc 
arg his arg arg arg 

egg egg ccc agt ggc 
arg arg pro ser gly 

gtc ggt cga aca ggt 
val gly arg thr gly 

ggg ccg cca gtc gga 
gly pro pro val gly 

caa caa cca cgt gat 
gin gin pro arg asp 

gac ggt a 
asp gly 



SEQ ID No.49C (continued 1) 
FIGURE 4 9C (continued 1) 



# 



172/185 



Coding sequence Rv0983 predicted by Cole et al . , 1998 (Nature 393:537-544) and 
containing seq60A: 

1/1 31/11 

atg gcc aag ttg gcc cga gta gtg ggc eta gta cag gaa gag caa cct age gac atg acg 
Met ala lys leu ala arg val val gly leu val gin glu glu gin pro ser asp met thr 
61/21 91/31 

aat cac cca egg tat teg cca ccg ccg cag cag ccg gga acc cca ggt tat get cag ggg 
asn his pro arg tyr ser pro pro pro gin gin pro gly thr pro gly tyr ala gin gly 
121/41 151/51 

cag cag caa acg tac age cag cag ttc gac tgg cgt tac cca ccg tec ccg ccc ccg cag 
aln qln gin thr tyr ser gin gin phe asp trp arg tyr pro pro ser pro pro pro gin 
181/61 211/71 

cca acc cag tac cgt caa ccc tac gag gcg ttg ggt ggt acc egg ccg ggt ctg ata cct 
pro thr gin tyr arg gin pro tyr glu ala leu gly gly thr arg pro gly leu ile pro 
241/81 271 ' 91 

ggc gtg att ccg acc atg acg ccc cct cct ggg atg gtt cgc caa cgc cct cgt gca ggc 
gly val ile pro thr met thr pro pro pro gly met val arg gin arg pro arg ala gly 
301/101 331/111 

atg ttg gcc ate ggc gcg gtg acg ata gcg gtg gtg tec gcc ggc ate ggc ggc gcg gcc 
met leu ala ile gly ala val thr ile ala val val ser ala gly ile gly gly ala ala 
361/121 391/131 

gca tec ctg gtc ggg ttc aac egg gca ccc gcc ggc ccc age ggc ggc cca gtg get gcc 
ala ser leu val gly phe. asn arg ala pro ala gly pro ser gly gly pro val ala ala 
421/141 451/151 

age gcg gcg cca age ate ccc gca gca aac atg ccg ccg ggg teg gtc gaa cag gtg gcg 
ser ala ala pro ser ile pro ala ala asn met pro pro gly ser val glu gin val ala 
481/161 511/171 

gcc aag gtg gtg ccc agt gtc gtc atg ttg gaa acc gat ctg ggc cgc cag teg gag gag 
ala lys val val pro ser val val met leu glu thr asp leu gly arg gin ser glu glu 
541/181 571/191 

ggc tec ggc ate att ctg tct gcc gag ggg ctg ate ttg acc aac aac cac gtg ate gcg 
gly ser gly ile ile leu ser ala glu gly leu ile leu thr asn asn his val lie ala 
601/201 631/211 

gcg gcc gcc aag cct ccc ctg ggc agt ccg ccg ccg aaa acg acg gta acc ttc tct gac 
ala ala ala lys pro pro leu gly ser pro pro pro lys thr thr val thr phe ser asp 
661/221 691/231 

ggg egg acc gca ccc ttc acg gtg gtg ggg get gac ccc acc agt gat ate gcc gtc gtc 
gly arg thr ala pro phe thr val val gly ala asp pro thr ser asp ile ala val val 
721/241 751/251 

cgt gtt cag ggc gtc tec ggg etc acc ccg ate tec ctg ggt tec tec teg gac ctg agg 
arg val gin gly val ser gly leu thr pro ile ser leu gly ser ser ser asp leu arg 
781/261 811/271 

gtc ggt cag ccg gtg ctg gcg ate ggg teg ccg etc ggt ttg gag ggc acc gtg acc acg 
val gly gin pro val leu ala ile gly ser pro leu gly leu glu gly thr val thr thr 
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841/281 

ggg ate gtc age get etc aac cgt cca gtg 
gly ile val ser ala leu asn arg pro val 
901/301 

acc gtg ctg gac gec att cag ace gac gec 
thr val leu asp ala ile gin thr asp ala 
961/321 

ctg gtg aac atg aac get caa etc gtc gga 
leu val asn met asn ala gin leu val gly 
1021/341 

gac tea gec gat gcg cag age ggc teg ate 
asp ser ala asp ala gin ser gly ser ile 
1081/361 

gec aag cgc ate gee gac gag ttg ate age 
ala lys arg ile ala asp glu leu ile ser 
1141/381 

gtg cag gtg acc aat gac aaa gac acc ctg 
val gin val thr asn asp lys asp thr leu 
1201/401 

ggt get gec gcg aac get gga gtg ccg aag 
gly ala ala ala asn ala gly val pro lys 
1261/421 

ccg ate aac age gcg gac gcg ttg gtt gee 
pro ile asn ser ala asp ala leu val ala 
1321/441 

gtg gcg eta acc ttt cag gat ccc teg ggc 
val ala leu thr phe gin asp pro ser gly 
1381/461 

aag gcg gag cag tga 
" lys ala glu gin OPA 



871/291 

teg acg acc ggc gag gee ggc aac cag aac 
ser thr thr gly glu ala gly asn gin asn 
931/311 

gcg ate aac ccc ggt aac tec ggg ggc gcg 
ala ile asn pro gly asn ser gly gly ala 
991/331 

gtc aac teg gec att gee acg ctg ggc gcg 
val asn ser ala ile ala thr leu gly ala 
1051/351 

ggt etc ggt ttt gcg att cca gtc gac cag 
gly leu gly phe ala ile pro val asp gin 
1111/371 

acc ggc aag gcg tea cat gec tec ctg ggt 
thr gly lys ala ser his ala ser leu gly 
1171/391 

ggc gee aag ate gtc gaa gta gtg gee ggt 
gly ala lys ile val glu val val ala gly 
1231/411 

ggc gtc gtt gtc acc aag gtc gac gac cgc 
gly val val val thr lys val asp asp arg 
1291/431 

gee gtg egg tec aaa gcg ccg ggc gee acg 
ala val arg ser lys ala pro gly ala thr 
1351/451 

ggt age cgc aca gtg caa gtc acc etc ggc 
gly ser arg thr val gin val thr leu gly 
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ORF according to Cole et al . , 1998 (Natu 
1/1 

tga gcc age teg acg cgt cgc acg ctg age 
OPA ala ser ser thr arg arg thr leu ser 
61/21 

ccg tgc agg age gcc gtc tgg tgt ttg aac 
pro cys arg ser ala val trp cys leu asn 
121/41 

tgc egg gtt egg gcc teg ggt tgg cga teg 
cys arg val arg ala ser gly trp arg ser 
181/61 

tgc tgc gca teg aag aca ccg ace cag gcg 
cys cys ala ser lys thr pro thr gin ala 
241/81 

tgc tec ccg gcc gtc gga tgc cga ttc cgc 
cys ser pro ala val gly cys arg phe arg 
301/101 

gca egg aca teg aga act etc ggg gtt egg 
ala arg thr ser arg thr leu gly val arg 
361/121 

cgc gcg caa cct agt tgt gca gtt act gtt 
arg ala gin pro ser cys ala val thr val 
421/141 

gcc aag ttg gcc cga gta gtg ggc eta gta 
ala lys leu ala arg val val gly leu val 
481/161 

cac cca egg tat teg cca ccg ccg cag cag 
his pro arg tyr ser pro pro pro gin gin 
541/181 

cag caa acg tac age cag cag ttc gac tgg 
gin gin thr tyr ser gin gin phe asp trp 
601/201 

acc cag tac cgt caa ccc tac gag gcg ttg 
thr gin tyr arg gin pro tyr glu ala leu 
661/221 

gtg att ccg acc atg acg ccc cct cct ggg 
val ile pro thr met thr pro pro pro gly 
721/241 

ttg gcc ate ggc gcg gtg acg ata gcg gtg 
leu ala ile gly ala val thr ile ala val 
781/261 

tec ctg gtc ggg ttc aac egg gca ccc gcc 
ser leu val gly phe asn arg ala pro ala 
841/281 

gcg gcg cca age ate ccc gca gca aac atg 
ala ala pro ser ile pro ala ala asn met 



re 393:537-544) and containing Rv0983 
31/11 

tgg tgg ttt ccg acc gcg gcc egg gca ttc 
trp trp phe pro thr ala ala arg ala phe 
91/31 

ggt ttt acc ggt egg cat egg cac ggg cgt 
gly phe thr gly arg his arg his gly arg 
151/51 

tea aac agg tgg tgc tea acc acg gcg gat 
ser asn arg trp cys ser thr thr ala asp 
211/71 

gcc age ccc ctg gaa cgt cga ttt acg tgc 
ala ser pro leu glu arg arg phe thr cys 
271/91 

age ttc ccg gtg cga egg ctg gcg etc gga 
ser phe pro val arg arg leu ala leu gly 
331/111 

cga acg tta tct cag tgg aat etc agt cca 
arg thr leu ser gin trp asn leu ser pro 
391/131 

gaa age cac acc cat gcc agt cca cgc atg 
glu ser his thr his ala ser pro arg met 
451/151 

cag gaa gag caa cct age gac atg acg aat 
gin glu glu gin pro ser asp met thr asn 
511/171 

ccg gga acc cca ggt tat get cag ggg cag 
pro gly thr pro gly tyr ala gin gly gin 
571/191 

cgt tac cca ccg tec ccg ccc ccg cag cca 
arg tyr pro pro ser pro pro pro gin pro 
631/211 

ggt ggt acc egg ccg ggt ctg ata cct ggc 
gly gly thr arg pro gly leu ile pro gly 
691/231 

atg gtt cgc caa cgc cct cgt gca ggc atg 
met val arg gin arg pro arg ala gly met 
751/251 

gtg tec gcc ggc ate ggc ggc gcg gcc gca 
val ser ala gly ile gly gly ala ala ala 
811/271 

ggc ccc age ggc gge cca gtg get gcc age 
gly pro ser gly gly pro val ala ala ser 
871/291 

ccg ccg ggg teg gtc gaa cag gtg gcg gcc 
pro pro gly ser val glu gin val ala ala 
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901/301 931/311 

aag gtg gtg ccc agt gtc gtc atg ttg gaa acc gat ctg ggc cgc cag teg gag gag ggc 
lys val val pro ser val val met leu glu thr asp leu gly arg gin ser glu glu gly 
961/321 991/331 

tec ggc ate att ctg tct gec gag ggg ctg ate ttg acc aac aac cac gtg ate gcg gcg 
ser gly ile ile leu ser ala glu gly leu ile leu thr asn asn his val lie ala ala 
1021/341 1051/351 

gee gec aag cct ccc ctg ggc agt ccg ccg ccg aaa acg acg gta acc ttc tct gac ggg 
ala ala lys pro pro leu gly ser pro pro pro lys thr thr val thr phe ser asp gly 
1081/361 1111/371 

egg acc gca ccc ttc acg gtg gtg ggg get gac ccc acc agt gat ate gee gtc gtc cgt 
ara thr ala pro phe thr val val gly ala asp pro thr ser asp ile ala val val arg 
1141/381 1171/391 

gtt cag ggc gtc tec ggg etc acc ccg ate tec ctg ggt tec tec teg gac ctg agg gtc 
val gin gly val ser gly leu thr pro ile ser leu gly ser ser ser asp leu arg val 
1201/401 1231/411 

ggt cag ccg gtg ctg gcg ate ggg teg ccg etc ggt ttg gag ggc acc gtg acc acg ggg 
gly gin pro val leu ala ile gly ser pro leu gly leu glu gly thr val thr thr gly 
1261/421 1291/431 

ate gtc age get etc aac cgt cca gtg teg acg acc ggc gag gec ggc aac cag aac acc 
ile val ser ala leu asn arg pro val ser thr thr gly glu ala gly asn gin asn thr 
1321/441 1351/451 

gtg ctg gac gec att cag acc gac gec gcg ate aac ccc ggt aac tec ggg ggc gcg ctg 
val leu asp ala ile gin thr asp ala ala ile asn pro gly asn ser gly gly ala leu 
1381/461 1411/471 

gtg aac atg aac get caa etc gtc gga gtc aac teg gec att gec acg ctg ggc gcg gac 
val asn met asn ala gin leu val gly val asn ser ala ile ala thr leu gly ala asp 
1441/481 1471/491 

tea gee gat gcg cag age ggc teg ate ggt etc ggt ttt gcg att cca gtc gac cag gee 
ser ala asp ala gin ser gly ser ile gly leu gly phe ala ile pro val asp gin ala 
1501/501 1531/511 

aag cgc ate gec gac gag ttg ate age acc ggc aag gcg tea cat gec tec ctg ggt gtg 
lys arg ile ala asp glu leu ile ser thr gly lys ala ser his ala ser leu gly val 
1561/521 1591/531 

cag gtg acc aat gac aaa gac acc ctg ggc gee aag ate gtc gaa gta gtg gec ggt ggt 
gin val thr asn asp lys asp thr leu gly ala lys ile val glu val val ala gly gly 
1621/541 1651/551 

get gec gcg aac get gga gtg ccg aag ggc gtc gtt gtc acc aag gtc gac gac cgc ccg 
ala ala ala asn ala gly val pro lys gly val val val thr lys val asp asp arg pro 
1681/561 1711/571 

ate aac age gcg gac gcg ttg gtt gee gee gtg egg tec aaa gcg ccg ggc gec acg gtg 
ile asn ser ala asp ala leu val ala ala val arg ser lys ala pro gly ala thr val 
1741/581 1771/591 

gcg eta acc ttt cag gat ccc teg ggc ggt age cgc aca gtg caa gtc acc etc ggc aag 

ala leu thr phe gin asp pro ser gly gly ser arg thr val gin val thr leu gly lys 

1801/601 

gcg gag cag tga 

ala glu gin OPA 
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Fragment amplified by PCR based on the sequence similarities with a serine protease of 
the E.coli HtrA family (creation of an SnaBI site at the 3' end) and subcloned into the 
vector pJVEDa: 

1/1 31/11 

gat ccg gcg ggg egg gtg teg gcg cag gcg tgg ctg gcg gtc acg gcg gtg egg gcg gtg 
asp pro ala gly arg val ser ala gin ala trp leu ala val thr ala val arg ala val 
61/21 91/31 

ccg ccg ggc tgt ggg gcg ccg gcg gcg gcg gtg gca atg gcg gga acg gcg ccg atg cca 
pro pro gly cys gly ala pro ala ala ala val ala met ala gly thr ala pro met pro 
121/41 151/51 

aca teg tea gcg gtg gag acg gtg gec teg gcg gtg ccg gtg gcg gtg gcg gat ggc tct 
thr ser ser ala val glu thr val ala ser ala val pro val ala val ala asp gly ser 
181/61 211/71 

acg gcg acg gcg ggg ccg gcg gac acg gcg gac aag gcg caa teg gec teg gcg gcg gcg 
thr ala thr ala gly pro ala asp thr ala asp lys ala gin ser ala ser ala ala ala 
241/81 271/91 

ccg gcg gcg acg ggg gec agg gcg gcg ccg gec gcg gac tgt ggg gta ctg gcg gcg ccg 
pro ala ala thr gly ala arg ala ala pro ala ala asp cys gly val leu ala ala pro 
301/101 331/111 

gcg gac acg gcg ggc aag gcg gtg gta ccg ggg gee cac cgc tgc ccg gtc agg cag gca 
ala asp thr ala gly lys ala val val pro gly ala his arg cys pro val arg gin ala 
361/121 391/131 

tgg gcg ccg egg gtg gcg ccg gtg ggc tga teg gca acg gcg ggg ccg gcg gcg acg gcg 
trp ala pro arg val ala pro val gly OPA ser ala thr ala gly pro ala ala thr ala 
421/141 451/151 

gtg teg gcg cgt ccg gcg ggg teg ccg gag tag gcg gtg ccg gcg gga acg cca tgc tga 
val ser ala arg pro ala gly ser pro glu AMB ala val pro ala gly thr pro cys OPA 
481/161 511/171 

teg ggc acg gcg gcg ccg gcg gcg ccg gcg gag aca gca gtt teg eta atg gcg egg ccg 
ser gly thr ala ala pro ala ala pro ala glu thr ala val ser leu met ala arg pro 
541/181 571/191 

gcg gcg egg gcg gtg ccg gag ggc ace tct teg gca atg gcg ggt ccg gcg gee acg gcg 
ala ala arg ala val pro glu gly thr ser ser ala met ala gly pro ala ala thr ala 
601/201 631/211 

gag ccg tea egg ccg gca aca ccg gta teg gtg gcg ccg gcg gcg teg gtg ggg acg cca 
glu pro ser arg pro ala thr pro val ser val ala pro ala ala ser val gly thr pro 
661/221 691/231 

ggc tga teg gee acg gtg gcg ccg gcg gtg ccg gcg ggg ace gcg ccg gag cct tgg ttg 
gly OPA ser aia thr val ala pro ala val pro ala gly thr ala pro glu pro trp leu 
721/241 751/251 

gee gtg acg gcg ggc ccg gtg gga acg ggg gcg ctg gcg gec age tat acg gca acg gcg 
ala val thr ala gly pro val gly thr gly ala leu ala ala ser tyr thr ala thr ala 
781/261 811/271 

gcg acg gcg ccc ccg gca ccg gcg gaa cac tgc agg egg egg tga gcg gat tgg tga egg 
ala thr ala pro pro ala pro ala glu his cys arg arg arg OPA ala asp trp OPA arg 
841/281 871/291 

ctt tgt teg gtg cac ccg gee aac ccg gcg aca ccg gee aac ccg get age ccc gat caa 
leu cys ser val his pro ala asn pro ala thr pro ala asn pro ala ser pro asp gin 
901/301 931/311 

cga ggg ttt egg tgc egg tec ggg gca tgg cca tec get gag ctg gcg ate tgg act acg 
arg gly phe arg cys arg ser gly ala trp pro ser ala glu leu ala ile trp thr thr 
961/321 991/331 

ttg gtg tag aaa aat cct gee gec egg ace ctt aag get ggg aca att tct gat age tac 
leu val AMB lys asn pro ala ala arg thr leu lys ala gly thr ile ser asp ser tyr 
1021/341 1051/351 

ccc gac aca gga ggt tac ggg atg age aat teg cgc cgc cgc tea etc agg tgg tea tgg 
pro asp thr gly gly tyr gly met ser asn ser arg arg arg ser leu arg trp ser trp 
1081/361 1111/371 

ttg ctg age gtg ctg get gee gtc ggg ctg ggc ctg gec acg gcg ccg gee cag gcg gee 
leu leu ser val leu ala ala val gly leu gly leu ala thr ala pro ala gin ala ala 
1141/381 

ccg ccg gee ttg teg cag gac egg tt 
pro pro ala leu ser gin asp arg 

SEQ ID NO.50A 
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1/1 31/11 

ate egg egg ggc ggg tgt egg cgc agg cgt ggc tgg egg tea egg egg tgc ggg egg tgc 
ile arg arg gly gly cys arg arg arg arg gly trp arg ser arg arg cys gly arg cys 
61/21 91/31 

cgc egg get gtg ggg cgc egg egg egg egg tgg caa tgg egg gaa egg cgc cga tgc caa 
arg arg ala val gly arg arg arg arg arg trp gin trp arg glu arg arg arg cys gin 
121/41 151/51 

cat cgt cag egg tgg aga egg tgg cct egg egg tgc egg tgg egg tgg egg atg get eta 
his arg gin arg trp arg arg trp pro arg arg cys arg trp arg trp arg met ala leu 
181/61 211/71 

egg cga egg egg ggc egg egg aca egg egg aca agg cgc aat egg cct egg egg egg cgc 
arg arg arg arg gly arg arg thr arg arg thr arg arg asn arg pro arg arg arg arg 
241/81 271/91 

egg egg cga egg ggg cca ggg egg cgc egg ccg egg act gtg ggg tac tgg egg cgc egg 
arg arg arg arg gly pro gly arg arg arg pro arg thr val gly tyr trp arg arg arg 
301/101 331/111 

egg aca egg egg gca agg egg tgg tac egg ggg ccc ace get gee egg tea ggc agg cat 
arg thr arg arg ala arg arg trp tyr arg gly pro thr ala ala arg ser gly arg his 
361/121 391/131 

ggg cgc cgc ggg tgg cgc egg tgg get gat egg caa egg egg ggc egg egg cga egg egg 
gly arg arg gly trp arg arg trp ala asp arg gin arg arg gly arg arg arg arg arg 
421/141 451/151 

tgt egg cgc gtc egg egg ggt cgc egg agt agg egg tgc egg egg gaa cgc cat get gat 
cys arg arg val arg arg gly arg arg ser arg arg cys arg arg glu arg his ala asp 
481/161 511/171 

egg gca egg egg cgc egg egg cgc egg egg aga cag cag ttt cgc taa tgg cgc ggc egg 
arg ala arg arg arg arg arg arg arg arg arg gin gin phe arg OCH trp arg gly arg 
541/181 571/191 

egg cgc ggg egg tgc egg agg gca cct ctt egg caa tgg egg gtc egg egg cca egg egg 
arg arg gly arg cys arg arg ala pro leu arg gin trp arg val arg arg pro arg arg 
601/201 631/211 

age cgt cac ggc egg caa cac egg tat egg tgg cgc egg egg cgt egg tgg gga cgc cag 
ser arg his gly arg gin his arg tyr arg trp arg arg arg arg arg trp gly arg gin 
661/221 691/231 

get gat egg cca egg tgg cgc egg egg tgc egg egg gga ccg cgc egg age ctt ggt tgg 
ala asp arg pro arg trp arg arg arg cys arg arg gly pro arg arg ser leu gly trp 
721/241 751/251 

ccg tga egg egg gec egg tgg gaa egg ggg cgc tgg egg cca get ata egg caa egg egg 
pro OPA arg arg ala arg trp glu arg gly arg trp arg pro ala ile arg gin arg arg 
781/261 811/271 

cga egg cgc ccc egg cac egg egg aac act gca ggc ggc ggt gag egg att ggt gac ggc 
arg arg arg pro arg his arg arg asn thr ala gly gly gly glu arg ile gly asp gly 
841/281 871/291 

ttt gtt egg tgc acc egg cca ace egg cga cac egg cca ace egg eta gee ccg ate aac 
phe val arg cys thr arg pro thr arg arg his arg pro thr arg leu ala pro ile asn 
901/301 931/311 

gag ggt ttc ggt gee ggt ccg ggg cat ggc cat ccg ctg age tgg cga tct gga eta cgt 
glu gly phe gly ala gly pro gly his gly his pro leu ser trp arg ser gly leu arg 
961/321 991/331 

tgg tgt aga aaa ate ctg ccg ccc gga ccc tta agg ctg gga caa ttt ctg ata get acc 
trp cys arg lys ile leu pro pro gly pro leu arg leu gly gin phe leu ile ala thr 
1021/341 1051/351 

ccg aca cag gag gtt acg gga tga gca att cgc gee gee get cac tea ggt ggt cat ggt 
pro thr gin glu val thr gly OPA ala ile arg ala ala ala his ser gly gly his gly 
1081/361 1111/371 

tgc tga gcg tgc tgg ctg ccg teg ggc tgg gee tgg cca egg cgc egg ccc agg egg ccc 
cys OPA ala cys trp leu pro ser gly trp ala trp pro arg arg arg pro arg arg pro 
1141/381 

cgc egg cct tgt cgc agg acc ggt t 
arg arg pro cys arg arg thr gly 

SEQ ID NO.50B 
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1/1 

tec ggc ggg gcg ggt gtc ggc gca ggc gtg 
ser gly gly ala gly val gly ala gly val 
61/21 

gec ggg ctg tgg ggc gec ggc ggc ggc ggt 
ala gly leu trp gly ala gly gly gly gly 
121/41 

ate gtc age ggt gga gac ggt ggc etc ggc 
ile val ser gly gly asp gly gly leu gly 
181/61 

ggc gac ggc ggg gec ggc gga cac ggc gga 
gly asp gly gly ala gly gly his gly gly 
241/81 

ggc ggc gac ggg ggc cag ggc ggc gee ggc 
gly gly asp gly gly gin gly gly ala gly 
301/101 

gga cac ggc ggg caa ggc ggt ggt acc ggg 
gly his gly gly gin gly gly gly thr gly 
361/121 

ggc gec gcg ggt ggc gec ggt ggg ctg ate 
gly ala ala gly gly ala gly gly leu ile 
421/141 

gtc ggc gcg tec ggc ggg gtc gec gga gta 
val gly ala ser gly gly val ala gly val 
481/161 

ggg cac ggc ggc gee ggc ggc gec ggc gga 
gly his gly gly ala gly gly ala gly gly 
541/181 

ggc gcg ggc ggt gec gga ggg cac etc ttc 
gly ala gly gly ala gly gly his leu phe 
601/201 

gee gtc acg gee ggc aac acc ggt ate ggt 
ala val thr ala gly asn thr gly ile gly 
661/221 

ctg ate ggc cac ggt ggc gee ggc ggt gec 
leu ile gly his gly gly ala gly gly ala 
721/241 

cgt gac ggc ggg ccc ggt ggg aac ggg ggc 
arg asp gly gly pro gly gly asn gly gly 
781/261 

gac ggc gec ccc ggc acc ggc gga aca ctg 
asp gly ala pro gly thr gly gly thr leu 
841/281 

ttg ttc ggt gca ccc ggc caa ccc ggc gac 
leu phe gly ala pro gly gin pro gly asp 
901/301 

agg gtt teg gtg ccg gtc egg ggc atg gee 
arg val ser val pro val arg gly met ala 
961/321 

ggt gta gaa aaa tec tgc cgc ccg gac cct 
gly val glu lys ser cys arg pro asp pro 
1021/341 

cga cac agg agg tta egg gat gag caa ttc 
arg his arg arg leu arg asp glu gin phe 
1081/361 

get gag cgt get ggc tgc cgt egg get ggg 
ala glu arg ala gly cys arg arg ala gly 
11*41/381 

gee ggc ctt gtc gca gga ccg gtt 
ala gly leu val ala gly pro val 



31/11 

get ggc ggt cac ggc ggt gcg ggc ggt gee 
ala gly gly his gly gly ala gly gly ala 
91/31 

ggc aat ggc ggg aac ggc gee gat gee aac 
gly asn gly gly asn gly ala asp ala asn 
151/51 

ggt gee ggt ggc ggt ggc gga tgg etc tac 
gly ala gly gly gly gly gly trp leu tyr 
211/71 

caa ggc gca ate ggc etc ggc ggc ggc gee 
gin gly ala ile gly leu gly gly gly ala 
271/91 

cgc gga ctg tgg ggt act ggc ggc gec ggc 
arg gly leu trp gly thr gly gly ala gly 
331/111 

ggc cca ccg ctg ccc ggt cag gca ggc atg 
gly pro pro leu pro gly gin ala gly met 
391/131 

ggc aac ggc ggg gee ggc ggc gac ggc ggt 
gly asn gly gly ala gly gly asp gly gly 
451/151 

ggc ggt gee ggc ggg aac gee atg ctg ate 
gly gly ala gly gly asn ala met leu ile 
511/171 

gac age agt ttc get aat ggc gcg gee ggc 
asp ser ser phe ala asn gly ala ala gly 
571/191 

ggc aat ggc ggg tec ggc ggc cac ggc gga 
gly asn gly gly ser gly gly his gly gly 
631/211 

ggc gee ggc ggc gtc ggt ggg gac gee agg 
gly ala gly gly val gly gly asp ala arg 
691/231 

ggc ggg gac cgc gee gga gee ttg gtt ggc 

giy giy as P ar g ala sly ala leu val 9iy 

751/251 

get ggc ggc cag eta tac ggc aac ggc ggc 
ala gly gly gin leu tyr gly asn gly gly 
811/271 

cag gcg gcg gtg age gga ttg gtg acg get 
gin ala ala val ser gly leu val thr ala 
871/291 

acc ggc caa ccc ggc tag ccc cga tea acg 
thr gly gin pro gly AMB pro arg ser thr 
931/311 

ate cgc tga get ggc gat ctg gac tac gtt 
ile arg OPA ala gly asp leu asp tyr val 
991/331 

taa ggc tgg gac aat ttc tga tag eta ccc 
OCH gly trp asp asn phe OPA AMB leu pro 
1051/351 

gcg ccg ccg etc act cag gtg gtc atg gtt 
ala pro pro leu thr gin val val met val 
1111/371 

cct ggc cac ggc gee ggc cca ggc ggc ccc 
pro gly his gly ala gly pro gly gly pro 
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179/185 

Coding sequence Rv0125 predicted by Cole et al., 1998 (Nature 393:537-544) and 
containing seq50A: 

1/1 31/11 

atg age aat teg cgc cgc cgc tea etc agg tgg tea tgg ttg ctg age gtg ctg get gcc 
Met ser asn ser arg arg arg ser leu arg trp ser trp leu leu ser val leu ala ala 
61/21 91/31 

gtc ggg ctg ggc ctg gcc acg gcg ccg gcc cag gcg gcc ccg ccg gcc ttg teg cag gac 
val gly leu gly leu ala thr ala pro ala gin ala ala pro pro ala leu ser gin asp 
121/41 151/51 

egg ttc gcc gac ttc ccc gcg ctg ccc etc gac ccg tec gcg atg gtc gcc caa gtg ggg 
ara phe ala asp phe pro ala leu pro leu asp pro ser ala met val ala gin val gly 
181/61 211/71 

cca cag gtg gtc aac ate aac acc aaa ctg ggc tac aac aac gcc gtg ggc gcc ggg acc 
pro gin val val asn ile asn thr lys leu gly tyr asn asn ala val gly ala gly thr 
241/81 271/91 

ggc ate gtc ate gat ccc aac ggt gtc gtg ctg acc aac aac cac gtg. ate gcg ggc gcc 
gly ile val ile asp pro asn gly val val leu thr asn asn his val ile ala gly ala 
301/101 331/111 

acc gac ate aat gcg ttc age gtc ggc tec ggc caa acc tac ggc gtc gat gtg gtc ggg 
thr asp ile asn ala phe ser val gly ser gly gin thr tyr gly val asp val val gly 
361/121 391/131 

tat gac cgc acc cag gat gtc gcg gtg ctg cag ctg cgc ggt gcc ggt ggc ctg ccg teg 
tvr asp arg thr gin asp val ala val leu gin leu arg gly ala gly gly leu pro ser 
421/141 451/151 

gcg gcg ate ggt ggc ggc gtc gcg gtt ggt gag ccc gtc gtc gcg atg ggc aac age ggt 
ala ala ile gly gly gly val ala val gly glu pro val val ala met gly asn ser gly 
481/161 511/171 

ggg cag ggc gga acg ccc cgt gcg gtg cct ggc agg gtg gtc gcg etc ggc caa acc gtg 
alv Qln gly gly thr pro arg ala val pro gly arg val val ala leu gly gin thr val 
541/181 571/191 

cag gcg teg gat teg ctg acc ggt gcc gaa gag aca ttg aac ggg ttg ate cag ttc gat 
gin ala ser asp ser leu thr gly ala glu glu thr leu asn gly leu ile gin phe asp 
601/201 631/211 

gcc gcg ate cag ccc ggt gat teg ggc ggg ccc gtc gtc aac ggc eta gga cag gtg gtc 
ala ala ile gin pro gly asp ser gly gly pro val val asn gly leu gly gin val val 
661/221 691/231 

ggt atg aac acg gcc gcg tec gat aac ttc cag ctg tec cag ggt ggg cag gga ttc gcc 
gly met asn thr ala ala ser asp asn phe gin leu ser gin gly gly gin gly phe ala 
721/241 751/251 

att ccg ate ggg cag gcg atg gcg ate gcg ggc cag ate cga teg ggt ggg ggg tea ccc 
ile pro ile gly gin ala met ala ile ala gly gin ile arg ser gly gly gly ser pro 
781/261 811/271 

acc gtt cat ate ggg cct acc gcc ttc etc ggc ttg ggt gtt gtc gac aac aac ggc aac 
thr val his ile gly pro thr ala phe leu gly leu gly val val asp asn asn gly asn 
841/281 871/291 

ggc gca cga gtc caa cgc gtg gtc ggg age get ccg gcg gca agt etc ggc ate tec acc 
qly ala arg val gin arg val val gly ser ala pro ala ala ser leu gly lie ser thr 
901/301 931/311 

ggc gac gtg ate acc gcg gtc gac ggc get ccg ate aac teg gcc acc gcg atg gcg gac 
qly asp val ile thr ala val asp gly ala pro ile asn ser ala thr ala met ala asp 
961/321 991/331 

gcg ctt aac ggg cat cat ccc ggt gac gtc ate teg gtg acc tgg caa acc aag teg ggc 
111 leu asn gly his his pro gly asp val ile ser val thr trp gin thr lys ser gly 
1021/341 1051/351 

ggc acg cgt aca ggg aac gtg aca ttg gcc gag gga ccc ccg gcc tga 
gly thr arg thr gly asn val thr leu ala glu gly pro pro ala OPA 
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180/185 

ORF according to Cole et al., 1998 (Nature 393:537-544) and containing Rv0125: 
1/1 31/11 

tag aaa aat cct gcc gcc egg acc ctt aag get ggg aca att tct gat age tac ccc gac 
AMB lys asn pro ala ala arg thr leu lys ala gly thr ile ser asp ser tyr pro asp 
61/21 91/31 

aca gga ggt tac ggg atg age aat teg cgc cgc cgc tea etc agg tgg tea tgg ttg ctg 
thr aly gly tyr gly met ser asn ser arg arg arg ser leu arg trp ser trp leu leu 
121/41 151/51 

age gtg ctg get gcc gtc ggg ctg ggc ctg gcc acg gcg ccg gcc cag gcg gcc ccg ccg 
ser val leu ala ala val gly leu gly leu ala thr ala pro ala gin ala ala pro pro 
181/61 211/71 

gcc ttg teg cag gac egg ttc gcc gac ttc ccc gcg ctg ccc etc gac ccg tec gcg atg 
ala leu ser gin asp arg phe ala asp phe pro ala leu pro leu asp pro ser ala met 
241/81 271/91 

gtc gcc caa gtg ggg cca cag gtg gtc aac ate aac acc aaa ctg ggc tac aac aac gcc 
val ala gin val gly pro gin val val asn ile asn thr lys leu gly tyr asn asn ala 
301/101 331/111 

gtg ggc gcc ggg acc ggc ate gtc ate gat ccc aac ggt gtc gtg ctg acc aac aac cac 
val gly ala gly thr gly ile val ile asp pro asn gly val val leu thr asn asn his 
361/121 391/131 

gtg ate gcg ggc gcc acc gac ate aat gcg ttc age gtc ggc tec ggc caa acc tac ggc 
val ile ala gly ala thr asp ile asn ala phe ser val gly ser gly gin thr tyr gly 
421/141 451/151 

gtc gat gtg gtc ggg tat gac cgc acc cag gat gtc gcg gtg ctg cag ctg cgc ggt gcc 
val asp val val gly tyr asp arg thr gin asp val ala val leu gin leu arg gly ala 
481/161 511/171 

ggt ggc ctg ccg teg gcg gcg ate ggt ggc ggc gtc gcg gtt ggt gag ccc gtc gtc gcg 
aly gly leu pro ser ala ala ile gly gly gly val ala val gly glu pro val val ala 
541/181 571/191 

atg ggc aac age ggt ggg cag ggc gga acg ccc cgt gcg gtg cct ggc agg gtg gtc gcg 
met gly asn ser gly gly gin gly gly thr pro arg ala val pro gly arg val val ala 
601/201 631/211 

etc ggc caa acc gtg cag gcg teg gat teg ctg acc ggt gcc gaa gag aca ttg aac ggg 
leu gly gin thr val gin ala ser asp ser leu thr gly ala glu glu thr leu asn gly 
661/221 691/231 

ttg ate cag ttc gat gcc gcg ate cag ccc ggt gat teg ggc ggg ccc gtc gtc aac ggc 
leu ile gin phe asp ala ala ile gin pro gly asp ser gly gly pro val val asn gly 
721/241 751/251 

eta gga cag gtg gtc ggt atg aac acg gee gcg tec gat aac ttc cag ctg tec cag ggt 
leu gly gin val val gly met asn thr ala ala ser asp asn phe gin leu ser gin gly 
781/261 811/271 

ggg cag gga ttc gcc att ccg ate ggg cag gcg atg gcg ate gcg ggc cag ate cga teg 
qly gin gly phe ala ile pro ile gly gin ala met ala ile ala gly gin ile arg ser 
841/281 871/291 

ggt ggg ggg tea ccc acc gtt cat ate ggg cct acc gcc ttc etc ggc ttg ggt gtt gtc 
gly gly gly ser pro thr val his ile gly pro thr ala phe leu gly leu gly val val 
901/301 931/311 

gac aac aac ggc aac ggc gca cga gtc caa cgc gtg gtc ggg age get ccg gcg gca agt 
asp asn asn gly asn gly ala arg val gin arg val val gly ser ala pro ala ala ser 
961/321 991/331 

etc ggc ate tec acc ggc gac gtg ate acc gcg gtc gac ggc get ccg ate aac teg gcc 
leu gly ile ser thr gly asp val ile thr ala val asp gly ala pro lie asn ser ala 
1021/341 1051/351 

acc gcg atg gcg gac gcg ctt aac ggg cat cat ccc ggt gac gtc ate teg gtg acc tgg 
thr ala met ala asp ala leu asn gly his his pro gly asp val ile ser val thr trp 
1081/361 1111/371 

caa acc aag teg ggc ggc acg cgt aca ggg aac gtg aca ttg gcc gag gga ccc ccg gcc 

gin thr lys ser gly gly thr arg thr gly asn val thr leu ala glu gly pro pro ala 

1141/381 

tga 

OPA 
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